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TITLE 

AUTOMATIC MAPPING FROM DATA TO PREPROCESSING ALGORITHMS 

CROSS-REFERENCE TO RELATED APPLICATIONS 
[0001] This application claims the benefit of U.S. Provisional Application No. 60/274,008, 

filed March 7, 2001. 

STATEMENT REGARDING FEDERALLY SPONSORED RESEARCH 
[0002] Part of the funding for research leading to this invention may have been provided 

under federal government contract number 30018-71 15, "ONR Algorithm Toolbox Development." 

REFERENCE TO COMPUTER PROGRAM LISTING APPENDIX 
[0003] This application includes a computer program appendix listing (in compliance with 

37 C.F.R. § 1.96) containing source code for a prototype of an embodiment. The computer program 
appendix listing is submitted herewith on one original and one duplicate compact disc (in 
compliance with 37 C.F.R. § 1.52(e)) designated respectively as Copy 1 and Copy 2 and labeled in 
comphancewith37C.F.R. § 1.52(e)(6). 

[0004] All the material in this computer program appendix listing on compact disc is 

hereby incorporated herein by reference, and identified by the following table of file names, 
creation/modification date, and size in bytes: 

CREATED/ SIZES IN 
NAMES OF FILES MODIFIED BYTES 

DMS\date_convert.c 18-Jun-01 12,557 

DMS\date_convert_mex.c 18-Jun-01 6,971 

DMS\determine_field_type.c 18-Jun-01 13,005 

DMS\determine_field_type_mex.c 18-Jun-01 4,061 

DMS\read_ascii_mix2.c 18-Jun-01 41,256 

DMS\read_ascii_mix2_mex.c 18-Jun-01 30,728 

DMS\read_palm.c 18-Jun-01 20,553 
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DMS\read_palm_mex.c 


18-Jun-Ol 


12,332 


DMS\date_convert.h 


18-Jun-Ol 


1,135 


DMS\datenum.h 


18-Jun-Ol 


1,080 


DMS\determine_field_type.h 


18-Jun-Ol 


1,076 


DMS\fgetl.h 


18-Jun-Ol 


841 


DMS\find_break.h 


18-Jun-Ol 


1,064 


DMS\fmd_date_field2.h 


18-Jun-Ol 


1,024 


DMS\find_mos.h 


18-Jun-Ol 


898 


DMS\isalpha.h 


18-Jun-Ol 


859 


DMS\mod.h 


18-Jun-Ol 


844 


DMS\read_ascii_mix2.h 


18-Jun-Ol 


1,414 


DMS\read_palm.h 


18-Jun-Ol 


1,300 


DMS\sec.h 


18-Jun-Ol 


831 


DMS\std.h 


18-Jun-Ol 


867 


DMS\str2num.h 


18-Jun-Ol 


853 


DMS\strvcat.h 


18-Jun-Ol 


884 


DMS\addonp.m 


26-Jun-01 


6,013 


DMS\addonrp.m 


26-Jun-01 


4,518 


DMS\adjust_barr.m 


17-May-01 


373 


DMS\adjust_barrr.m 


17-May-01 


377 


DMS\align_time.m 


19-Jun-01 


373 


DMS\all_inf.m 


26-Jan-01 


793 


DMS\arcovp.m 


25-Jun-01 


797 


DMS\auto_input_select.m 


26-Jan-01 


1,165 


DMS\auto_select_input.m 


2-Jul-01 


1,711 


DMS\b_read.m 


18-Aug-00 


4,813 


DMS\batch_kdd.m 


10-May-Ol 


46,083 


DMS\batch_palm.m 


ll-May-01 


45,855 


DMS\binconv.m 


ll-Jun-01 


114 


DMS\blind_test.m 


12-Jun-01 


4,778 


DMS\blindblind.m 


12-Jul-01 


3,446 


DMS\bnn_act_bk.m 


6-Mar-01 


10,800 


DMS\bvarr.m 


9-Jul-01 


4,797 


DMS\candlestick.m 


24-Aug-00 


177 


DMS\cat_string_field.m 


10-May-01 


662 


DMS\catcell.m 


31-May-01 


149 


DMS\cell2num.m 


l-Nov-99 


447 


DMS\clas_discrete_combine.m 


26-Jun-01 


5,487 


DMS\collagen.m 


14-Aug-00 


2,693 


DMS\compile_results.m 


23-Apr-01 


5,478 


DMS\compile_results_m.m 


23-Apr-01 


4,915 


DMS\concatstr.m 


4-Jun-01 


108 
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DMS\convert_wk2mo.m 


ll-May-01 


755 


DMS\convertAtoB .m 


21-May-01 


684 


DMS\convertYmd2Date.m 


19-Jun-01 


332 


DMS\corr_coeff.m 


26-Jan-01 


1,168 


DMS\corr_rank.m 


16-Jun-01 


316 


DMS\create_thrombo_metadata.m 


17-May-01 


1,517 


DMS\csv2strv.m 


16-May-01 


341 


DMS\ctb_hist2.m 


24-May-Ol 


2,179 


DMS\dataload.m 


23-Apr-01 


7,962 


DMS\dataload_m.m 


23-Apr-01 


7 5 810 


DMS\dataload2.m 


26-Jan-01 


2,056 


DMS\dataload2_m.m 


26-Jan-01 


2,403 


DMS\date_convert.m 


18-Jun-Ol 


519 


DMS\date_display.m 


12-Aug-00 


150 


DMS\date_interval.m 


12-Jun-01 


556 


DMS\DCT_feat.m 


ll-Jun-01 


452 


DMS\decimate_scatter.m 


30-May-Ol 


1,648 


DMS\decode_answer.m 


16-Jun-01 


218 


DMS\delete_figures.m 


23-Apr-01 


1,086 


DMS\detailed_results.m 


10-May-Ol 


4,990 


DMS\determine_catord.m 


20-Sep-00 


249 


DMS\determine_field_type.m 


2-Jul-Ol 


931 


DMS\deunderscore.m 


27-May-01 


175 


DMS\dimension_reduction.m 


ll-Jun-01 


1,525 


DMS\dimension_reductionS.m 


7-Jun-01 


1,402 


DMS\display_example.m 


10-May-Ol 


260 


DMS\dm_batch.m 


30-Jun-Ol 


2,866 


DMS\dm_expert.m 


ll-May-01 


191 


DMS\dm_expert_gui.m 


12-Jul-01 


11,194 


DMS\dm_expert_part.m 


12-May-01 


1,716 


DMS\dm_expert_ran.m 


12-Jul-Ol 


8,208 


DMS\DM_recommend.m 


8-Jun-Ol 


4,718 


DMS\dmr_expert_gui.m 


22-Jun-01 


8,360 


DMS\dmr_expert_part.m 


29-Jun-01 


2,736 


DMS\dmr_expert_ran.m 


2-Jul-01 


7,441 


DMS\dms_dataload.m 


23-Apr-01 


317 


DMS\dms_demo.m 


23-Apr-01 


1,975 


DMS\dms_main.m 


26-Jun-01 


6,159 


DMS\dms_params.m 


12-Jul-Ol 


4,048 


DMSVDWT.m 


16-Jun-01 


578 


DMS\elim_article.m 


29-Jan-01 


586 


DMS\embed_sm.m 


10-Nov-OO 


282 
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DMS\embed_smooth.m 

DMS\enco.m 

DMS\energy_compact.m 

DMS\exl_getmat.m 

DMS\exl_setmat.m 

DMS\explain_candle.m 

DMS\explain_llr.m 

DMS\explain_oc.m 

DMS\explain_pdf.m 

DMS\explain_pfi.m 

DMS\explain_scat.m 

DMS\explore_macro.m 

DMS\explore_ts.m 

DMS\explorelD.m 

DMS\extract_time_feat.m 

DMS\feature_rank.m 

DMS\find_break.m 

DMS\find_comma.m 

DMS\find_date_field.m 

DMS\find_date_field2.m 

DMS\find_drug_feat.m 

DMS\find_drug_feat2.m 

DMS\fmd_field.m 

DMS\fmd_future.m 

DMS\findJp.m 

DMS\find_mos.m 

DMS\find_var_zero.m 

DMS\fm_clean.m 

DMS\fin_prep.m 

DMS\formatTime.m 

DMS\frank_rank.m 

DMS\FromGT.m 

DMS\FromInputl .m 

DMS\FromInput2.m 

DMS\FromOutput.m 

DMS\FromSegment.m 

DMS\FromSegment2.m 

DMS\FromTime.m 

DMS\gen_dcrm.m 

DMS\gen_dcrm2.m 

DMS\gen_dcrm3.m 

DMS\gen_mog_metadata.m 



21-May-01 205 

19-Feb-01 350 

5-Jun-01 822 

l-Nov-99 2,681 

l-Nov-99 4,084 

28-Aug-00 716 

23-Apr-01 533 

23-Apr-01 413 

28-Aug-00 413 

23-Apr-01 641 

28-Aug-00 454 



22-Jun-01 


2,551 


22-Jun-01 


2,141 


26-Jun-01 


6,058 


29-Jun-01 


1,171 


ll-Jun-01 


464 


14-Jun-01 


537 


19-Jun-01 


380 


15-Jun-01 


151 


20-Jun-01 


248 


14-Aug-00 


918 


26-Aug-00 


1,019 


26-Jun-01 


3,235 


29-Jun-01 


121 


15-May-01 


321 


20-Jun-01 


203 


26-Jun-01 


286 


25-Aug-00 


1,129 


26-Aug-00 


153 


19-Jun-01 


586 


31-May-01 


270 


4-Jun-01 


240 


9-May-01 


54 


16-Jun-01 


298 


16-Jun-01 


377 


25-May-01 


270 


18-Jun-01 


248 


13-Jun-01 


159 


13-Jun-01 


1,366 


13-Jun-01 


1,382 


14-Jun-01 


912 


21-May-01 


283 
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DMS\generate_lift_pdf.m 


12-Jun-01 


2,379 


DMS\genPalmTS,m 


19-Jun-01 


1,748 


DMS\get__boundary.m 


6-Jun-01 


635 


DMS\get__metadata.m 


12-Jul-01 


8,730 


DMSXginput proem 


19-Jun-01 


271 


DMS\glm_actJ>k.m 


6-Mar-01 


0 


DMS\global_„var.m 


ll-May-01 


841 


DMS\gmm_act__bk.m 


6-Mar-01 


11,021 


DMS\ground_truth.m 


4-Jun-Ol 


1,597 


DMS\gt_process 1 .m 


4-Jun-01 


1,085 


DMS\gt_show_choice.m 


4-Jun-01 


538 


DMS\gt__truth.m 


4-Jun-01 


1,697 


DMS\input_help .m 


23-Apr-01 


3,153 


DMS\input_help_m.m 


24-Apr-01 


4,536 


DMS\insert2Time.m 


19-Jun-01 


704 


DMS\io_help.m 


24-Jun-01 


4,404 


DMS\k_errorbar.m 


25-Aug-00 


3,400 


DMS\kdd_sysparam.m 


26-Aug-00 


328 


DMS\knn_act_bk.m 


6-Mar-01 


11,314 


DMSVks repress m 


20-Jun-01 


322 


DMS\lala_redux.m 


ll-Jun-01 


1,590 


DMSUfc act bkm 


6-Mar-01 


10,641 


DTVTSMn nrediet m 


25-Jun-01 


439 


DMS\lp_predict_bt.m 


26-Jun-01 


556 


DMS\to t>redict2m 


25-Jun-01 


237 


DMS\1r>c nred m 


26-Jun-01 


339 


DMSMsvm m 


27-Jun-01 


518 


DMS\main kdd2001 m 


6-Jun-01 


145 


DMS\main palm.m 


27-May-01 


294 


DMS\main uci tin 

X / It X. k> \XJ.XX-VXAX Lt-VX »XJ.X 


5-Jun-01 


17,394 


DMS\rnakeiteven im 

J-^J,VXkJ \lIltilVvi IV/ V Cxi.*xAJ. 


23-Aug-00 


757 


DMS\m aster homcea.m 


26-Jan-01 


1,926 


DM S\m aster hotneew tn 


26-Jan-01 


1,847 


THVTSl\ma<?ter kdd m 


20-Feb-01 


2,100 


DMS\master_mail.m 


26-Jan-01 


1^929 


DMS\max_matrix.m 


27-Aug-00 


164 


DMS\max_matrixr.m 


31-May-01 


323 


DMS\mean_ks.m 


14-Jun-01 


59 


DMS\median_norm.m 


4-Jun-01 


497 


DMS\merge_clas.m 


6-Jun-01 


427 


DMS\merge_tables.m 


29-Nov-OO 


6,357 


DMS\metadata_list.m 


20-Jun-01 


2,050 
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DMS\mlp_act_bk.m 

DMS\mm_kdd.m 

DMS\mom_rank.m 

DMS\more_results.m 

DMS\more_results_r2.m 

DMS\more_results2.m 

DMS\mssk_est.m 

DMSVmsskk.m 

DMS\multi_table.m 

DMS\mvg_act_bk.m 

DMS\tinc_act_bk.m 

DMS\norm_reg.m 

DMS\one_inb.m 

DMS\one_inf.m 

DMS\one_inf_m.m 

DMS\one_outb.m 

DMS\one_outf.m 

DMS\one_outf_m.m 

DMS\outlier_det.m 

DMS\outlier_det_pert.m 

DMS\own_process.m 

DMS\palm_customer_mapping.m 

DMS\Palm_customer_match.m 
DMS\palm_derive_fields.m 

DMS\palm_events.m 

DMS\Palm_product_sales.m 

DMS\palm_time_series_fields.m 

DMS\palm_time_series_fields2.m 

DMS\PalmAllS_postprocess.m 

DMS\PC_tradeoff.m 

DMS\pca_feat.m 

DMS\pfapd.m 

DMS\pl_fx-m 

DMS\pl_reset.m 

DMS\pl_run.m 

DMS\pl_zoom.m 

DMS\playwithfin.m 

DMS\plot_time_series.m 

DMS\pnn_act_bk.m 

DMS\prep_dm.m 

DMS\prep_macro_econ.m 

DMS\prepare_data2.m 

-6 



6-Mar-01 10,650 

24-Jan-01 624 

29-May-01 507 

10-May-Ol 5,111 

20-Jun-01 3,450 



^A Tii« A1 




C Tn« A1 


417 


3-jun-ui 


Q41 


18-Aug-0U 




ll-May-0l 




6-Mar-0l 


1 a Aon 


3l-May-0l 




26-Jan-0l 




29-Aug-00 


1 £GA 


20-Sep-00 




22-Aug-00 


D /I 


19-Aug-00 


1 1 

1,221 


20-Sep-00 


4y3 


28-May-01 


JO / 


30-May-01 


81/ 


24-Jun-0l 


TIC 


15-Jun-Ol 




18-Jun-Ol 


1,264 


20-Jun-0l 


2,539 


l2-Jun-0l 


2,359 


17-Jun-Ol 


626 


15-Jun-Ol 


1,233 


18-Jun-Ol 


2,114 


19-Jun-Ol 


TOO 

52.1 


5-Jun-0l 


594 


2l-May-0l 


158 


29-Aug-00 


384 


22-Aug-00 


354 


22-Aug-00 


81 


22-Aug-00 


1,414 


22-Aug-00 


1,218 


22-Aug-00 


2,505 


l9-Jun-0l 


4,156 


6-Mar-0l 


11,124 


20-Sep-00 


129 


ll-May-Ol 


1,230 


24-Jan-0l 


5,185 
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DMS\prepare_data3 .m 

DMS\rbf_act_bk.m 

DMS\read_ascii_mix.m 

DMS\read„ascii_mix2.m 

DMS\read_ascii_mix3,m 

DMS\read_ascii_mix5 .m 

DMS\read_fred_mos.m 

DMS\read_free_wkly.m 

DMS\read_mailing.m 

DMS\read_names.m 

DMS\read_palm.m 

DMS\read_time_samples.m 

DMS\read_uci.m 

DMS\read _yeastm 

DMS\remove_outlier.m 

DMS\reset_inoutm 

DMS\reset_io.m 

DMS\resetTime.m 

DMS\resolve_customer_ambiguity.m 

DMS\run_dm.m 

DMS\run_dm_master.m 

DMS\run_now.m 

DMS\saveTime.m 

DMS\select_input_m.m 

DMS\setdiff_unsort.m 

DMS\show_croc.m 

DMS\show_or_hide.m 

DMS\show_orJiide_reg.m 

DMS\show_pdfns.m 

DMS\show„percentile.m 

DMS\showfeatm 

DMS\showfeatPDF.m 

DMS\showfeatPDFr.m 

DMS\sort_str.m 

DMS\str2datenum.m 

DMS\str2strs.m 

DMSVstrchop.m 

DMS\strmatchfuzz.m 

DMS\strmf.m 

DMSVstrvcmp.m 

DMS\subgroup_segmentm 

DMS\svd_fill_missing.m 

-7- 



1 ^ T 1 A 1 

12-Jul-01 




6-Mar-O I 


1U, / lo 


15-Jun-01 


ri "5 AO 

2,393 


21-Jun-01 


3,316 


1 /■" T A1 

16-Jun-01 


Z ? !>Z4 


If) T _ f\ 1 

18-Jun-01 


0 A TO 

z,4/y 


10-May-01 


845 


10-May-01 


1,133 


20-Sep-00 


364 


5-Jun-01 


253 


18-Jun-01 


1,269 


23-Apr-01 


6,413 


24-May-01 


o 1 

816 


21-Jun-01 


171 


27-Aug-00 


345 


29-Nov-OO 


223 


22-Jun-01 


513 


13-Jun-01 


66 


18-Jun-01 


731 


2-Jul-Ol 


1,336 


31-May-01 


348 


28-Aug-00 


639 


19-Jun-01 


458 


23-Apr-01 


428 


17-May-01 


220 


21-May-01 


381 


ll-May-01 


249 


31-May-01 


255 


21-May-01 


446 


14-Jun-01 


463 


10-May-01 


3,259 


20-Jun-01 


A SCO 

4,658 


16-Jun-01 


1,267 


l-Jun-01 


424 


27-May-01 


204 


8-May-01 


1,130 


10-May-01 


190 


4-Jun-01 


564 


10-May-01 


716 


10-May-01 


271 


20-Jun-01 


1,702 


24-May-01 


1,019 
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DMS\svd_helpm.m 


l-Jun-01 


112 




DMS\svd_te_helpm.m 


l-Jun-01 


195 




DMS\svd_ter.m 


l-Jun-01 


2,328 




DMS\svm_act_bk.m 


6-Mar-01 


11,003 




DMS\TBFVE.m 


25-Jun-01 


925 




DMS\test_bvar.m 


10-Jul-01 


555 




DMS\test Isvm.m 


27-Jun-01 


413 




DMS\test_makeiteven.m 


23-Aug-00 


166 




DMS\test_own.m 


24-Jun-01 


140 




DMS\test_svd_te_help.m 


l-Jun-01 


353 




DMS\testBlind.m 


12-Jul-01 


1,309 




DMS\time_fe.m 


1 l-Jun-01 


844 




DMS\time_feat_extm 


22-Jun-01 


1,191 




DMS\titrie &n\ m 

lyivXk/ ItilXAV *kl**X*JlJ.JL 


19-Jun-01 


3,590 




DMS\ToGT m 


4-Jun-01 


897 


^2 


DMS\ToInputl.m 


2-Jul-01 


450 


ESS-" 


DMS\ToInput2.m 


19-Jun-01 


450 


£ |» 


DMS\ToOutputm 


29-Jun-01 


2,379 




DMSYToSegment m 


5-Jul-01 


1,661 


1=1 


DMS\ToTime m 

1/lVlU \ lul 1111 V/. Ill 


13-Jun-01 


111 


13 


DMSlWci trend in 

J-yiVlU \ V VI Li. Vxxvi.xll 


15-Mav-01 


437 




DMS\wherc_is_fhe_beef2 .m 


12-Jul-01 


3,432 




DMS\ where is the beefr2 m 


25-Jun-01 


3,134 




DMSWhy selection.m 


l-Jun-01 


2,324 


H- 


DMSWhyr selection.m 


31-May-01 


752 




DM S\zeropad.m 


24-Jun-01 


388 




F)lVrS\700fnk"s m 

irf'J.VXk!/ VllliViJ .ill 


12-Aue-00 


14,211 




DlVrS\7r»fimrot tin 

_L/J.V_I_k_> \Xjv7VJ11J 1 V./ U .111 


21-Mav-01 


803 




DMSYREADME txt 


13-Jul-01 

1 ^ »l vll V 1 


429 






12- Tu 1-01 


12 272 




OSP\d<!r>piTor m 

i/|Jl \UOUvXlv/ltill 


22-Jun-OO 


1,842 




jl^lja vvjoJ-'i v/Cituti V' .111 


5-Jul-OO 


4,225 




T^^H-Ancnorn tn 




31 515 




rKP\fkr>1n m 
i/or \U.v>piU.IIl 


12- Tn 1-01 

X£* J til V 1 


1 291 




DSP\EIH.m 


5-Jul-00 


4,021 




DSP\err.m 


22-Jun-00 


580 




DSP\feature_vis.m 


4-Jul-00 


3,248 




DSP\fieldsave.m 


28-Jun-00 


1,607 




DSP\fieldsave_fig.m 


5-Jul-00 


2,240 




DSP\fieldsel.m 


4-Jul-00 


1,423 




DSP\fieldsel_fig.m 


5-Jul-00 


2,746 




DSP\finsel.m 


5-Jul-00 


3,621 
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u or uiju&ci iig.iii 


5-Tul-OO 

nj J UX \J\J 


10 234 


i^SlPiiVhfWPtriJiti m 

IVLJl \lJllclo wlluiJ .111 


5-Tul-00 

•J J 111 yJ\J 


1 54S 


DSP\sneo m^nu m 


12-Jul-01 


2 ? 674 


n^PWatiiQ tn 

iyOr \oLCllliO.Ill 


1?-Tu1-01 

1 J WY \j 1 






5-Tul-OO 


268 


DSP\tfr menu m 

iVvjl \IXX lllt/llU.,111 


12-Iul-Oi 


9 95R 

7 ? 7^ O 


J70r MllL'W 111.111 


22- Tun -00 


4 464 


DSPYTFRSTFT mM 


22-Jun-00 

*J 1*4-1.1 V/ V/ 


2,759 


1PARP\README 


23-Jun-94 


838 


TPAT?P\iirlrlPp*cirhi;*1c r 


9^ Tn! 01 


Z1,JJ7 


TP APP\fiHfiP PCidn^W tnPY f* 
Ix / vIYl lliCA.V 


26-Tu1-0l 

j til vy x 


1 233 


ii rvivi vdviviivv o i V-I Lien o V/ . v/ 


X" X CU ul 


3 75S 


tpapp\amfr^a r 


^X"jrcu yo 


4 835 


TPARP\AMC>T<nA C 


1Q-Fph-QR 


84? 


TPARP\ann c 


7-Dec-Q7 


6 21 R 


TPAl?P\flwi tpQt p 


H-Anr-QQ 




TPAPP\fmrl nfMoKhrw p 


1 S-Anr-OQ 

1 J Apl 77 


78Q 

/ U7 


TP A PP\frn norm p 
lx r\JtvJr Mill UUflll.C 


IS Tn1-QQ 


647 


TPAPPVhict nVm r 
lx t\i\r uli o l_JxUii . i> 


X^?"Jdll"V/ X 


1 507 


TPAPP\Tii<itp p 

lx ZTLXVl U11SIV.V 


1 S-Anr-QQ 

1 J / VjJi 77 


1 ?46 


TPAT?P\knti c 

11 / V1V1 VIVlllU V- 


XU X CU ul 


14 41? 


TPARPWrtn mpif c 


16-Feb-01 


3 740 


TPAPPMnmr r 




2 509 


TPARP\martFva1 c 

1 1 / vlVT ullCtlLX-/VCll.L* 


26-Tul-Ol 

J IX I V/ X 


8 231 


TP APP\triarfP^cil tn ay p 
xx/\xvr viTlariiZ- Vdl IllCXX 


Tn 1-01 


5 ^Q^ 


TPATCPWiflTlFvalC r 


21-Feb-Ol 

^ x x v/iy w x 


5,010 


TPAPPVmrlp p 


X^ /vjjl 7 7 


2 149 


TPAPP\tn1n p 

lx /AJvx Uilip.w 


16-Feh-01 

X\J X CU VII 


16 484 


ir^y\tvr \niip_incX.c 


1A-F^V\ m 




TP APP\m1rpor p 


?0-Tnn-01 

Z»U J till V/ 1 


17 050 


TP A P P\tnl tv*aT thpy p 


20- Tun -01 

ZA/ J 1X11 \J X 


6 208 


TP APPXtip-iahlvYr <ihflt*p p 
ii .rvivr uicigiiuvji oiiciit/»i/ 


13-Jul-99 

X«^ J UJ 7 7 


1 393 


TPAPPVnnp p 




2 37? 


TPARP\nnminalSn1.tr c 

IX ZTJLVJ. UltALlllll«li*Jl-'llLV_>.V' 


20-Feb-01 


3,842 


IPARP\nominalSplitC_mex.c 


26-Jul-01 


1,378 


IPARP\nominalSplitC_mex_interface.c 


26-Jul-01 


5,361 


IPARP\Numcat.c 


13-Dec-98 


28,979 


IPARP\numericSplitC.c 


20-Feb-01 


2,597 


IPARP\obj_finder.c 


15-Apr-99 


1,072 


IPARP\pnn.c 


15-Apr-99 


2,861 


IPARP\pnn2.c 


17-Oct-OO 


2,785 


IPARP\pnn3.c 


17-Oct-00 


2,826 
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TPARPYRAN1 C 

IX ilivi \XVZil > 1 « Vy 


19-Feb-98 


896 


IPARP\RANDOM C 


31-Mar-98 


2,476 


IPARPVranord.c 


15-Apr-99 


943 


IPARP\rbfc 


16-Feb-01 


12,762 


IPARP\rbf_mex.c 


16-Feb-01 


3,864 


TPARP\Relax c 

11 x H VI UVvltlA. V 


30-Mar-98 


9,089 


TPARP\Rer>1ace c 


18-Jul-98 

x \j o i*i y \j 


16,348 


TP ARP\<iptVa1np«iFrntYiRp<;idnfi1^ c 

JLL jfiXVl \i>t'L V Cll 11^ Ol HJllllv^OlvAU-Cllo*^ 


26-Jul-01 


12,710 


TPARP\setValuesFromResiduals mex c 


26-Jul-01 


3,947 


TP A R P\ optVal 1 1 f^FrntnR pqi Hii al r 


19-Feb-01 

1 y L vU'v 1 


3 772 


TPARPWmifi^h r 
lx rvivi \&uticioii.\^ 


i o J U-i .7 0 


3 665 


IPARP\StateSDace c 


24-Nov-98 


19,359 


TPAPPWtflte^tv^p r> 
ix i^vivr \o utico|-/ctvC .c 


1 R-Tul-98 


21 924 


IPARP\Stats c 


24-Nov-98 


4,320 


IPARPVSTwrite c 


21-Set>98 


2,228 


TPARPWvH fp r 




22 312 


TPAPP\«WI tf* VieOr* r 


14-Tu1-99 


1 100 


TPAPPWwl fp> mf*Y o 


4^1 ~J Ull V/l 


15 51? 

1 wJ, *J 1Z< 


IPARP\Tred2 c 


22-Feb-98 


3,562 


TP A P P\Trimcmnl n 


94-NTrw-QS 


^ 4.1 0 


TPARP\TTti1 r 




1 1 359 

1 1 ^jjy 


TPARPWn r 


2S-Aup-99 


12 414 


TPAPPWni r> 

lr /xivr wqi. C 




1? 101 


TPARP\WrtCr c 


24-Nov-98 


3 369 


TP APP\WrfPctrmc r> 


x y j in yo 


4 467 


TPAPP\WrfPTP r 
lr rVtvr \ W I Ixx-D.U 


1\U V 70 




TP APP\Wt*fPt*Ar\ r> 

ir /vtvr \ w ixr rep . C 


Zf*T i>i u v y o 


1 1 


TPAPPYWrf^ltat r 
lr rVxvT WVHo lau t 


ZrT- rN u v y o 


? 173 


TP APP\arMT?PciHn*a1c Ti 




1 14? 


ir AKx \QCTeiJiiine_iiciG_iype .n 


71 Tnn 01 


1 071 


TPARP\rii\t? h 


16-Feb-01 

X\J X W v V X 


846 


TPARPYDn h 

XX -TV XVI VL/Lf.lI 


24-Nov-98 


15,666 


TP APP\icetrnr»t h 
lr Jajxt UooU UvLll 


16-Feb-01 


R54 


IrAKJ'MCtxtl.n 


itvreD-Ux 




IPARP\martEvaLh 


26-Jul-01 


966 


IPARP\martEvalC_mex_interface,h 


26-Jul-01 


1,175 


IPARP\mean.h 


21-Jun-01 


844 


IPARP\median.h 


26-Jul-01 


874 


IPARP\mlp.h 


16-Feb-01 


1,030 


IPARP\mlregr.h 


20-Jun-Ol 


1,163 


IPARP\nominalSplitC_mex_interface,h 


26-Jul-01 


1,300 


IPARP\NRUTIL.H 


7-Dec-96 


3,431 
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IPARP\rbf.h 


16-Feb-01 


947 


IPARP\rbfunpak.h 


16-Feb-01 


872 


IPARP\setValuesFromResiduals.h 


26-Jul-Ol 


1,224 


IPARP\svd_te.h 


21-Jun-01 


1,161 


IPARP\svd te help.h 


21-Jun-01 


988 


IPARP\svd_teJielpm,h 


21-Jun-01 


1,001 


IPARP\trace.h 


21-Jun-01 


836 


IPARP\access2fin.m 


25-May-01 


881 


IPARP\ACTIVLEV.M 


12-May-98 


6,174 


IPARPVaddon.m 


26-Jul-Ol 


5,992 


IPARP\addon_b.m 


19-Oct-00 


4,436 


IPARP\addonJl.m 


19-Oct-00 


2,308 


IPARPVaddonr.m 


3-Apr-01 


4,604 


IPARP\addResiduals,m 


16-Feb-01 


1,070 


IPARPVadjustkLm 


13-Jul-99 


1,255 


IPARP\axnp__statm 


13-Jul-99 


1,314 


IPARP\arbshow.m 


12-Dec-00 


3,614 


IPARP\assign_tgtm 


25-Apr-01 


4,870 


IPARP\auvq.m 


12-Jul-99 


4,348 


IPARP\averageNodeOutput.m 


21-Feb-01 


272 


IPARP\avq.m 


19-Oct-00 


5,228 


IPARP\avq_actm 


6-Mar-01 


12,439 


IPARP\avq__dlg.m 


19-Oct-00 


3,343 


IPARP\bl0to2.m 


23-Jun-94 


941 


IPARP\backward.m 


14-Jul-99 


1,194 


IPARP\barxy.m 


6-Dec-00 


7,551 


IPARP\batch_dlg.m 


3-Sep-99 


1,487 


IPARP\batch2_dlg.m 


19-Oct-00 


30,339 


IPARP\batta.m 


19-Oct-00 


2,228 


IPARP\Betap.m 


2-Aug-98 


470 


IPARP\Betaq.m 


2-Aug-98 


920 


IPARP\Betar.m 


2-Aug-98 


366 


IPARP\Binomt3 m 


28-Jul-99 


497 


IP ARP\Binomr.m 


28-Jul-99 


390 


IPARP\bn_infer.m 


16-May-00 


1,082 


IPARP\bn_train.m 


16-May-00 


1,948 


IPARP\bnc_after_infer.m 


l-Jun-00 


938 


IPARP\bnc_infer.m 


12-Jun-00 


833 


IPARP\bnc_process.m 


12-Jun-OO 


1,925 


IPARP\bnc_run_infer.m 


l-Jun-00 


1,141 


IPARP\bnc_train.m 


19-May-00 


1,625 


IPARP\bnc_train2.m 


31-May-00 


1,515 
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IPARP\bncm_infer.m 


12-Jun-00 


1,549 


IPARP\bncm_process.m 


12-Jun-00 


667 


IPARP\bnd_infer.m 


12-Jun-00 


1,119 


IPARP\bnd_process.m 


20-Jun-00 


2,524 


IPARP\bnd_run_infer.m 


12-Jun-00 


1,510 


IPARP\bndm_infer.m 


12-Jun-00 


2,075 


IPARP\bndm_process.m 


12-Jun-OO 


517 


IPARP\bnh_after_infer.m 


12-Jun-OO 


1,326 


IPARP\bnh_infer.m 


12-Jun-OO 


986 


IPARP\bnh_process.m 


25-Jul-00 


2,629 


IPARP\bnh_run_infer.m 


25-Jul-00 


1,510 


IPARP\bnh_train.m 


6-Mar-01 


4,508 


IPARP\bnh_train2.m 


30-May-00 


862 


IPARP\bnhm_infer.m 


12-Jun-OO 


1,942 


IPARP\bnhm_process.m 


12-Jun-OO 


620 


IPARP\bnn.m 


19-Oct-00 


3,597 


IPARP\bnn_act.m 


6-Mar-01 


12,792 


IPARP\bnn_act_b.m 


6-Mar-01 


10,754 


IPARP\bnn_act_hpcm 


19-Oct-00 


9,687 


IPARP\bnn_actg.m 


20-Feb-01 


4,037 


IPARP\bnn_dlg.m 


20-Feb-01 


3,947 


IPARP\bnn_dlgg.m 


20-Feb-01 


3,146 


IPARP\bnn_dlgs.m 


23-Oct-00 


4,491 


IPARP\bnng_body.m 


6-Mar-01 


8,058 


IPARP\BNT_ui.m 


25-Jul-00 


2,234 


IPARP\bpn.m 


19-Oct-00 


1,973 


IPARP\bpn_act.m 


19-Oct-00 


10,325 


IPARP\bpn_dlg.m 


19-May-99 


2,295 


IPARP\brn.m 


18-May-Ol 


913 


IPARP\brn_act.m 


28-Mar-01 


12,659 


IPARP\brn_dlg.m 


28-Mar-01 


3,773 


IPARP\bm_pr_act.m 


28-Mar-01 


7,700 


IPARP\bm_pr_dlg.m 


28-Mar-01 


3,758 


IPARP\brnr.m 


28-Mar-01 


541 


IPARP\cartPredict.m 


21-Feb-01 


1,154 


IPARP\cdd.m 


25-Jan-01 


445 


IPARP\cddd.m 


25-Jan-01 


737 


IPARP\cell2num.m 


l-Nov-99 


447 


IPARP\celldisp.m 


15-May-OO 


1,378 


IPARP\celldisp2.m 


15-May-00 


1,469 


IPARP\class_ftise.m 


6-Mar-01 


2,314 


IPARP\class_partition.m 


19-Oct-00 


3,369 
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IPARP\cluster_merge.m 


9-M-98 


1,032 


IPARP\cluster_test.m 


26-Oct-00 


1,449 


IPARP\cmsort.m 


12-May-98 


2,843 


IPARP\coh.m 


2-Apr-01 


1,778 


IPARP\compare_CR.m 


10-Jun-99 


1,704 


IPARP\compJ.m 


19-Oct-00 


2,081 


IPARP\compJM.m 


19-Oct-00 


2,961 


IPARP\compLL.m 


19-Oct-00 


1,915 


IPARP\comp0.m 


12-Jul-99 


1,376 


IPARP\cont_disc.m 


27-Mar-01 


297 


IPARP\cont_or_disc.m 


27-Mar-01 


297 


IPARPVContents.m 


9-Dec-99 


2,945 


IPARP\corr.m 


2-Apr-01 


2,460 


IPARP\corrld.m 


14-Aug-00 


1,091 


IPARP\CPDdisp.m 


l-Jun-00 


1,176 


IPARP\cpdf.m 


12-Jul-99 


1,241 


IPARP\CPDh_disp.m 


2-Jun-00 


1,337 


IPARP\CPTdisp.m 


l-Jun-00 


1,729 


IPARP\crlb_body.m 


9-Jul-98 


4,963 


IPARP\ctb_histc.m 


16-Apr-01 


2,424 


IPARP\dann_act.m 


26-Jul-01 


12,671 


IPARP\dann_actg.m 


26-Jul-01 


3,786 


IPARP\dann_dlg.m 


26-Jul-01 


3,552 


IPARP\dann_dlgg.m 


26-Jul-01 


2,758 


IPARP\danng_body.m 


26-Jul-01 


8,054 


IPARP\datgen.m 


19-Oct-00 


1,742 


IPARP\dbnd_run_infer.m 


22-Jun-00 


1,511 


IPARP\decode.m 


23-Jun-94 


853 


IPARP\derivs.m 


2-May-97 


410 


IPARP\determine_data_type.m 


2-May-01 


869 


IPARP\disc_disc_assoc.m 


7-Mar-01 


381 


IPARP\disp_field_name.m 


2-May-01 


336 


IPARP\disp_tree.m 


2-Feb-01 


1,332 


IPARP\display_data_misc.m 


16-Oct-00 


1,804 


IPARP\display_rank.m 


26-Feb-01 


282 


IPARP\diverg.m 


19-Oct-00 


2,573 


IPARP\dlmhdrload.m 


22-Jan-01 


1,420 


IPARP\dmult.m 


2-May-97 


123 


IPARPVdoCPD.m 


25-Jul-00 


1,590 


IPARP\doCPDh.m 


25-Jul-00 


2,190 


IPARP\done_tgt.m 


7-Mar-01 


1,649 


IPARP\dyadic.m 


20-Mar-01 


202 
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IPARP\em_act.m 


19-Oct-00 


r 1 1 1 

5,111 


IPARP\em_dlg.m 


l-Sep-99 


a on 

2,817 


IPARP\em_new_dlg.m 


1 ^ T,,1 C\C\ 

12-Jul-99 


l,OZO 


IPARP\em_vq.m 


12-Jul-99 


2,328 


IPARP\embed.m 


21-Dec-00 


1,557 


IPARP\embed_sm.m 


l-Mar-01 


323 


IPARP\embed_smooth.m 


24-Jul-01 


205 


IPARP\entropy.m 


7-Mar-01 


263 


IPARP\epic_act.m 


l-Sep-99 


3,436 


IPARP\epic_eval.m 


l-Sep-99 


3,193 


IPARP\epwic_act.m 


l-Sep-99 


3,157 


IPARP\epwic_act2.m 


l-Sep-99 


3,527 


IPARP\epwic_eval.m 


l-Sep-99 


3,185 


IPARP\est_mean_freq.m 


20-Apr-0l 


367 


IPARP\exl_act.m 


6-Mar-0l 


1,341 


IPARP\exl_getmat.m 


l-Nov-99 


2,681 


IPARP\exl_setmat.m 


l-Nov-99 


4,084 


IPARP\fact.m 


l2-Jul-99 


1,296 


IPARP\fdr.m 


19-Oct-OO 


2,243 


IPARP\fdrc.m 


16-Apr-Ol 


845 


IPARP\fe_add_dir.m 


2-May-0l 


765 


IPARP\fe_pred_act.m 


19-Oct-OO 


3,887 


IPARP\fe_pred_anal.m 


19-Oct-OO 


2,558 


IPARP\fe_pred_anal2.m 


19-Oct-OO 


4,225 


IPARP\fe_pred_dlg.m 


23-Mar-01 


4,846 


IPARP\feat_gen.m 


19-Oct-OO 


3,163 


IPARP\featcorr.m 


19-Oct-OO 


4,260 


IPARP\featgen.m 


19-Oct-OO 


3,646 


IPARP\fec_class.m 


12-Jan-01 


3,472 


IPARP\fext_act.m 


7-May-01 


2,572 


IPARP\fext_dlg.m 


3-May-Ol 


3,001 


IPARP\ff_ext.m 


22-Dec-00 


1,878 


IPARP\ff_ext2.m 


21-Dec-00 


2,529 


IPARP\filesize.m 


12-Jul-99 


1,048 


IPARP\fill_act.m 


23-Jan-01 


3,382 


IPARP\fill_act_mm.m 


2-Jan-01 


3,093 


IPARP\find_absent.m 


12-Jul-99 


1,302 


IPARP\find_enc.m 


12-Jul-99 


1,501 


EPARP\fmd_harmonic.m 


30-Apr-Ol 


715 


IPARP\find_mono_rep.m 


19-Mar-01 


694 


IPARP\fmd_neighbor.m 


12-Jul-99 


1,077 


IPARP\fmdkil.m 


8-Dec-00 


175 





IPARP\fmdm.m 




IPARP\findms.m 




IPARPVFmdmu.m 




IPARP\fmdmu2.m 




IPARP\fmdtab.m 




IPARPVfiro.m 




IPARP\fm_norm.m 




IPARP\forward.m 




IPARP\freq„tracker.m 




IPARP\fukunaga.m 




IPARP\fiikusep.m 




IPARP\fuse_bag.m 




IPARP\fuse_boost.m 




IPARP\fuse_fec.m 


-r? 


IPARP\fuse_stack.m 




IPARP\fusion_dlg.m 


'"^ 


IPARP\fusion_dlgg,m 


■« i s « 


IPARP\ga_fam 


; s 


IPARP\ga__reduce.m 




IPARP\gen_act.m 




IPARP\gen_cont_data.m 




IPARP\gen_disc_data.m 




IPARP\genJiybrid_data.m 




IPARP\genJiybrid_data2.m 


1 1 1 


IPARP\gen_time_series.m 




IPARP\gendemo.m 




IPARP\generate_clas_pdf.m 




IPARP\generate_cmat.m 




IPARP\genetic.m 




IPARP\genplotm 




IPARP\glm_actm 




IPARP\glm_act_b.m 




IPARP\glm_act_hpc.m 




IPARP\glm_actg.m 




IPARP\glm_dlg.m 




IPARP\glm_dlgg.m 




IPARP\glm_dlgs.m 




IPARP\glmgJ)ody.m 




IPARP\glmm.m 




IPARP\gmm_act.m 




IPARP\gmm_act_b.m 




IPARP\gmm_act_hpc.m 



PATENT 
7648/81349 
00SC105.111 



15-Jul-99 


1,947 


15-Jul-99 


1,629 


12-Jul-99 


1,272 


15-Jul-99 


1,502 


22-Jan-01 


280 


12-Jul-99 


1,688 


12-Jul-99 


1,575 


17-Oct-00 


1,184 


2-May-01 


768 


15-Jan-01 


566 


19-Oct-00 


1,861 


6-Mar-01 


2,411 


6-Mar-01 


1,852 


6-Mar-01 


3,364 


6-Mar-01 


2,146 


19-Oct-00 


33,649 


8-Jan-01 


2,732 


19-Dec-00 


385 


27-Feb-01 


1,536 


20-Dec-00 


2,817 


31-May-00 


1,220 


14-Jun-00 


72 


l-Jun-00 


623 


25-Jul-00 


690 


21-Feb-01 


35 


23-Jun-94 


7,442 


28-Mar-01 


1,476 


27-Mar-01 


541 


19-Dec-OO 


8,390 


23-Jun-94 


932 


6-Mar-01 


12,705 


6-Mar-01 


10,711 


19-Oct-00 


9,650 


20-Feb-01 


3,910 


l-Sep-99 


3,514 


20-Feb-01 


2,728 


23-Oct-00 


4,062 


6-Mar-01 


8,062 


19-Oct-00 


2,879 


6-Mar-01 


13,129 


6-Mar-01 


10,976 


19-Oct-00 


9,813 
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IPARP\gmm_actg.m 

IPARP\gmm_dlg.m 

IPARP\gmm_dlgg.m 

IPARP\gmm_dlgs.m 

IPARP\gmmg_body.m 

IPARP\gmmm.m 

IPARP\group_partition,m 

IPARP\henon.m 

IPARP\hist_unique.m 

IPARP\hist2.m 

IPARP\hmm.m 

IPARP\hmm_act.m 

IPARP\hmm_cl.m 

IPARP\hmm_dlg.m 

IPARP\hmmk.m 

IPARP\hough.m 

IPARP\hspc_cmatm 

IPARP\hspc_cmat2.m 

IPARP\hspcl.m 

IPARPMexplore.m 

IPARP\index_sub.m 

IPARPXiparp.m 

IPARP\isalpha.m 

IPARPVisnum.m 

IPARP\jointPD.m 

IPARPVjointPDc.m 

IPARP\k_means_dlg.m 

IPARP\km_act.m 

IPARP\km_eclass.m 

IPARP\km_new_dlg.m 

IPARP\knn_act.m 

IPARP\knn_act_b.m 

IPARP\knn_act_hpcm 

IPARP\knn_actg.m 

IPARP\knn_dlg.m 

IPARP\knn_dlgg.m 

IPARP\knn_dlgs.m 

IPARP\knng_body.m 

IPARP\knnk.m 

IPARP\knnm.m 

IPARP\kread.m 

IPARP\kread„exceLm 
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20-Feb-01 


3,998 


l-Sep-99 


3,862 


20-Feb-01 


3,090 


23-Oct-00 


4,406 


6-Mar-01 


8,062 


18-May-01 


3,301 


7-May-01 


581 


12-Jul-99 


1,586 


8-Dec-OO 


234 


2-Apr-01 


2,190 


12-Jul-99 


3,474 


19-Oct-00 


8,659 


12-Jul-99 


1,521 


2-Apr-01 


2,899 


12-Jul-99 


3,087 


10-Jun-99 


4,173 


19-Oct-00 


1,734 


19-Oct-00 


1,735 


23-Oct-00 


3,657 


13-Oct-00 


1,726 


13-Jul-99 


1,499 


26-Jul-01 


16,483 


13-Jul-01 


336 


25-Apr-01 


110 


16-May-00 


252 


31-May-00 


209 


26-Oct-00 


2,913 


26-Oct-00 


5,680 


19-Oct-00 


1,396 


13-Jul-99 


1,672 


6-Mar-01 


13,618 


6-Mar-01 


10,609 


19-Oct-00 


9,550 


19-Oct-00 


3,960 


4-Sep-99 


3,500 


12-Oct-00 


2,729 


23-Oct-00 


4,048 


6-Mar-01 


9,032 


19-Oct-00 


2,203 


22-May-01 


2,515 


13-Jul-99 


1,303 


20-Dec-00 


1,021 
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IPARP\ks_excel.m 

IPARP\kwrite.m 

IPARPMfcm 

IPARP\lfc_act.m 
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IPARP\LLR_integrator.m 
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IPARP\mlp_act.m 
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IPARP\mlp_act_hpcm 

IPARP\mlp_actg.m 

IPARP\mlp_dlg.m 

IPARP\mlp_dlgg.m 

IPARP\mlp_dlgs.m 

IPARP\mlp_pr_act.m 

IPARP\mlp_pr_dlg.m 

IPARP\mlpg_body.m 

IPARP\mlpm.m 

IPARPVmlprm.m 

IPARPXmlreg.m 

IPARP\mlreg_pr_act.m 

IPARP\mlreg_pr_dlg.m 

IPARP\mlregr.m 

IPARP\moe_pr_act.m 

IPARP\moe_pr_dlg.m 

IPARP\moerm.m 

IPARP\mom.m 

IPARP\mssk.m 

IPARP\mutate.m 

IPARP\mutual_info.m 

IPARPWg.m 

IPARP\mvg_act.m 
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13-Jul-99 


3,541 
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699 


16-Jan-01 


2,921 
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IPARP\nlt_toggle.m 
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IPARP\nnc.m 

IPARP\nnc_actm 

IP ARP\nnc_act_b m 
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IPARP\nnc_actg.m 

IPARP\nnc_dlg.m 

IPARP\nnc_dlgg.m 

IPARP\nnc_dlgs.m 

IPARP\nncg_body.m 

IPARPVnormal.m 

IPARP\normal_b.m 

IPARP\normr2,m 

IPARP\num2pop.m 

IPARP\open„access.m 

IPARP\open_data.m 

IPARP\open_excel.m 

IPARP\open_excel2.m 
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IPARP\open_ssdir.m 
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6-Mar-01 


11,503 


19-Oct-00 


10,444 


7-May-01 


3,980 


2-Sep-99 


3,507 


7-May-01 


3,046 


23-Oct-00 


4,042 


19-Dec-00 


173 


7-May-01 


8,788 


6-Mar-0 1 


8,120 


25-Jan-01 


1,792 


6-Mar-01 


13,196 


20-Feb-01 


4,233 


15-Jan-01 


4,084 


20-Feb-01 


3,041 


15-Jul-99 


1,768 


2-Apr-01 


1,782 


17-Jan-01 


1,347 


15-Dec-00 
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9-Jul-Ol 


6,783 


13-Jul-99 


1,816 


6-Mar-01 


12,617 


6-Mar-01 


10,644 
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9,585 


20-Feb-01 


3,934 


2-Sep-99 


3,289 


20-Feb-01 


2,503 


23-Oct-OO 


3,819 


6-Mar-01 


8,984 


ll-Apr-01 


2,288 
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1,251 


28-Mar-01 


172 


26-Feb-01 
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25-Apr-01 


1,782 


12-Jun-00 


262 


19-Oct-00 


1,685 


24-Oct-00 
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25-Oct-00 


884 


12-Jun-00 


308 


23-Mar-01 


2,874 


l-May-01 


1,419 


15-Mar-01 


1,504 


7-May-01 


3,715 
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IPARP\open2.m 

IPARP\openc.m 

IPARP\openrl.m 

IPARP\openr2.m 

IPARP\opent.m 

IPARP\opentJxt.m 

IPARP\organize_unk_dat.m 

IPARP\ortho.m 

IPARP\ortho_3d.m 

IPARP\orthotemp.m 

IPARP\outlier_removal.m 

IPARP\output_tree.m 

IPARP\part_boot.m 

IPARP\part_random.m 

IPARP\part_stratify.m 

IPARP\partfb.m 

IPARP\partfbr.m 

IPARP\partition.m 

IPARP\partran.m 

IPARPVpartranr.m 

IPARP\partt_random.m 

IPARP\peak_interp.m 

IPARP\plot_candle.m 

IPARP\plot_indi.m 

IPARP\plot_MD.m 

IPARP\plot_pdf.m 

IPARP\plot_time.m 

IPARP\plot41d.m 
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IPARP\prai_act_b.m 
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IPARP\pnn_dlgg.m 
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1,210 
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2,398 
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520 
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2,122 
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1,827 
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12,603 
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10,516 
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9,457 
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3,756 


2-Sep-99 


3,515 
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2,728 
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4,061 
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8,077 


2-Mar-01 


2,948 


28-Mar-95 


1,228 


26-Feb-01 
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IPARP\pred_dlg.m 

IPARP\prep_discretize.m 

IPARP\prep_outlier.m 

IPARP\prep_represent.m 

IPARP\prepare_affy_data.m 

IPARP\prepare_data.m 

IPARP\Prob.m 

IPARP\process_fh.m 

IPARP\profit_calc.m 

IPARP\prune.m 
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IPARP\prune_det_coeff.m 
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IPARP\prune_tree.m 
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3,963 


20-Feb-01 


2,949 
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IPARP\rbf_pr_dlg.m 

IPARP\rbfg_body.m 

IPARP\rbfm.m 

IPARPVbfrm.m 

IPARP\read_affy.m 

IPARP\read_ascii.m 

IPARP\read_txtm 

IPARP\read_txt2.m 

IPARP\recompr.m 

IPARP\Regr.m 

IPARP\regression_datgen.m 

IPARP\removems.m 

IPARPVreproduc.m 

IPARP\rest_skm.m 

IPARP\rocho.m 

IPARP\rtree.m 

IPARPVrugplotm 
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IPARP\run_fixsion.m 
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IPARP\save_netm 

IPARP\savefea.m 
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IPARP\show_cont.m 

IPARP\show_dis.m 
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IPARP\SS_anaLm 
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IPARP\SS_plotm 


12-Apr-01 
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IPARP\SSS_anal,m 


10-Nov-00 


2,099 


IPARP\SSS_plot.m 


19-Oct-00 
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IPARP\SSufficientMain.m 


l-Mar-01 
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IPARP\SSufficientStatm 


6-Mar-01 


2,667 


IPARP\str2num_mult.m 


26-Jul-01 


212 


IPARP\str2pop.m 


16-Jun-01 


403 


IPARP\strh2strv.m 


15-Mar-01 


184 


IPARP\strinsert.m 


17-Jan-01 


496 


IPARP\SufficientMain.m 


ll-Apr-01 


281 


IPARP\SufFicientStat.m 


12-Apr-01 
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IPARP\svd_te.m 


l-Jun-01 
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27-Mar-01 
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9-Feb-01 


534 


IPARP\test rocm 


17-Oct-00 
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IPARP\test stress m 


l-Mav-01 


3,998 


TPARP\tesfpen m 
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IPARP\threearb m 

XX J. 1JLV1 ill XX VvUl \J till 


l-Sep-99 


2,420 


IPARP\trivial_know.m 


23-Apr-01 


312 


IPARP\trn.m 


19-Dec-00 


13,000 


IPARP\TS_fe.m 


23-Mar-01 


1,587 


IPARP\TS_feat_ext.m 


23-Mar-01 


1,302 


IPARP\TS_norm_plot.m 


20-Mar-01 


602 


IPARP\TS_xform.m 


27-Mar-01 


2,648 


IPARP\tst.m 


19-Dec-00 


4,397 


IPARP\twoDmom.m 


19-Oct-00 


2,216 
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IPARP\uniquek.m 14-Jul-99 1,258 

IPARP\USASI.M ll-Dec-00 1,671 

IPARP\view3d.m 28-Jun-99 13,442 

IPARP\vq.m 9-Jul-98 1,043 

IPARP\vqi.c.m 26-Oct-00 12,199 

IPARP\waterfall_k.m 20-Apr-01 331 

IPARP\wav_fe.m 25-Apr-01 3,134 

IPARP\xover.m 23-Jun-94 703 

IPARP\ZEROTRIM.M 12-May-98 1,259 

IPARP\MART\addResiduals.c 26-Jul-01 21,359 

IPARP\MART\addResiduals_mex.c 26-Jul-01 1,233 

IPARP\MART\addResidualsC.c 19-Feb-01 3,755 

IPARP\MART\martEval.c 26-Jul-01 8,231 

IPARP\MART\martEval_mex.c 26-Jul-01 5,693 

IPARP\MART\martEvalC.c 21-Feb-01 5,010 

IPARP\MAR1\nominalSplitC.c 20-Feb-01 3,842 

IPARP\MART\nominalSplitCjmex.c 26-Jul-01 1,378 

IPARP\MARl\nominalSplitC_mex_interface.c 26-Jul-01 5,361 

IPARP\MARl\numericSplitC.c 20-Feb-01 2,597 

IPARP\MART\setValuesFromResiduals.c 26-Jul-01 12,710 

IPARP\MART\setValuesFromResiduals_mex.c 26-Jul-01 3,947 

IPARP\MART\setValuesFromResidualsC.c 19-Feb-01 3,772 

IPARP\MART\addResiduals.h 26-Jul-01 1,142 

IPARP\MART\martEval.h 26-Jul-01 966 

IPARP\MARl\martEvalC_mex_interface.h 26-Jul-01 1,175 

IPARP\MART\median.h 26-Jul-01 874 

IPAPJ > \MART\nominalSplitC_mex_interface.h 26-Jul-01 1,300 

IPARP\MART\setValuesFromResiduals.h 26-Jul-01 1,224 

IPARP\MART\addResiduals.m 16-Feb-01 1,070 

IPARP\MART\averageNodeOuq)ut.m 21-Feb-01 272 

IPARP\MART\cartPredictm 21-Feb-01 1,154 

IPARP\MART\kread.m 13-Jul-99 1,303 

IP ARP\M ARTvmattm 22-Mar-01 2,305 

IPARP\MART\mart2.m 21-May-01 2,260 

IPARP\MART\martAccuracy.m 5-Mar-01 656 

IPARP\MART\martEval.m 5-Mar-01 811 

IP ARP\MARTVmartPredict.m 5-Mar-01 624 

IPARP\MART\martr.m 3-Apr-01 2,299 

IP ARP\MART\martTrain.m 26-Jul-01 6,368 

IPARP\MART\partition.m 12-Feb-01 947 

IPARP\MART\rtree.m 22-Mar-01 5,848 

IPARP\MART\setValuesFromResiduals.m 5-Mar-01 630 
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IPARP\MART\simpleRTree.m 

IPARP\MART\test_mart.m 

IPARP\MART\README.txt 

IPTvChangeLog 
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IPT\makefile,v 

IPT\passwd 

IPTvperms 
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IPT\README,v 
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IPT\access.log.001 

IPTtfismysql.OOl 

IPT\access.log.002 

IPT\access.log.003 

IPT\access.log.004 

IPTAhosts.allow 

IPT\_ISDEL.EXE 

IPT\convert_image.exe 

IPTdptalg.exe 

IPT\nsd.exe 

IPT\SETUP.EXE 

IPT\string_escape.exe 

IPT\unzip.exe 

IPT\zip.exe 

IPT\_SETUP.DLL 

IPT\getHTTP.dll 

IPT\libmySQL.dll 

IPTvnscgi.dll 

IPT\nscp.dll 
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IPT\nslog.dll 

IPT\nsmysql.dll 

IPT\nsperm.dll 

IPT\nssock.dll 

IPT\nsssle.dll 

IPTVnstcl.dll 

IPT\nsthread.dll 

IPT\LAYOUT.BIN 

IPT\logo.bmp 
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IPT\plain logo.bmp 


28-Aug-003 


,693,882 


IP1\SETUP.BMP 


12-Feb-98 


86,878 


IPT\cfar.c 


22-Sep-00 
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IPT\convert_image.c 


23-Sep-00 


6,454 


IPT\detectc 


10-Jul-01 


16,282 


IPTVdispatcher.c 
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[0005] A portion of the disclosure of this patent document contains material which is 

subject to copyright protection. The copyright owner has no objection to the facsimile reproduction 
by any one of the patent document or the patent disclosure, as it appears in the Patent and Trademark 
Office patent file or records, but otherwise reserves all copyright rights whatsoever. 

BACKGROUND 

[0006] This invention relates generally to a data processing apparatus and corresponding 

methods for the analysis of data stored in a database or as computer files and more particularly to a 
method for selecting appropriate algorithms based on data characteristics such as, for example, 
digital signal processing ("DSP") and image processing ("IP"). 
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[0007] As bandwidth becomes more plentiful, data mining must be able to handle spatially 

and temporally sampled data, such as image and time-series data, respectively. DSP and IP 
algorithms transform raw time-series and image data into projection spaces, where good features can 
be extracted for data mining. The universe of the algorithm space is so vast that it is virtually 
impossible to try out every algorithm in an exhaustive fashion. 

[0008] DSP relates generally to time series data. Time series data may be recorded by any 

conventional means, including, but not limited to, physical observation and data entry, or electronic 
sensors connected directly to a computer. One example of such time series data would be sonar 
readings taken over a period of time. A further example of such time series data would be financial 
data. Such financial data may typically be reported in conventional sources on a daily basis or may 
be continuously updated on a tick-by-tick basis. A number for algorithms are known for processing 
various types of time-series digital signal data in data mining applications. 

[0009] IP relates generally to data representing a visual image. Image data may relate to a 

still photograph or the like, which has no temporal dimension and thus does not fall within the 
definition of digital signal time series data as customarily understood. In another embodiment, 
image data may also have a time series dimension such as in a moving picture or other series of 
images. One example of such a series of images would be mammograms taken over a period of 
time, where radiologists or other such users may desire to detect significant changes in the image. In 
general, an objective of IP algorithms is to maximize, as compactly as possible, useful information 
content concerning regions of interest in spatial, chromatic, or other applicable dimensions of the 
digital image data. A number of algorithms are known for processing various types of image data. 
Under certain situations, spatial sensor data require preprocessing to convert sensor time-series data 
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into images. Examples of such spatial sensor data include radar, sonar, infrared, laser, and others. 
Examples of such preprocessing include synthetic -aperture processing and beam forming. 
[0010] Currently known data-mining tools lack a generalized capability to process sampled 

data. Instead, techniques in the areas of DSP and IP explore specific approaches developed for 
different application areas. For example, some techniques explore a combination of autoregressive 
moving average time-series modeling (also known as linear predictive coding ("LPC") in the speech 
community for the autoregressive portion) and a neural-network approach for econometric data 
analysis. As a further example, one commercially available economic data-mining application relies 
on vector autoregressive moving average with exogenous input for econometric time-series analysis. 
Other known techniques appear similar to sonar multi-resolution signal detectors, and may use a 
combination of the fast Fourier transform and Yule-Walker LPC analyses for time-series modeling 
of physiological polygraphic data, or propose a time-series pattern-matching system that relies on 
frame-based, geometric shape matching given training templates. Yule-Walker LPC is a standard 
technique in estimating autoregressive coefficients in, for example, speech coding. It uses time- 
series data rearranged in the form of a Toelpitz data matrix. 

[0011] Still other known approaches, for example, use geometric and/or spectral features to 

find similar patterns in time-series data, or suggest a suite of processing algorithms for object 
classification, without the benefit of automatic algorithm selection. Known approaches, for 
example, describe an integrated approach to surface anomaly detection using various algorithms 
including IP algorithms. All these approaches explore a small subset in the gigantic universe of 
processing algorithms based on intuition and experience. 
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[0012] In difficult data-mining problems, the bulk of performance gain may be attributable 

to judicious preprocessing and feature extraction, not to the backend data mining. Because the search 
space of such preprocessing algorithms is comparatively extremely large, global optimization based 
on an exhaustive search is virtually impossible. Locally optimal solutions tend to be ad hoc and 
cover only a limited algorithm-search space depending on the level of algorithmic expertise of the 
user. These approaches do not take advantage of a prior performance database and differences in the 
level of algorithm complexity to allow rapid convergence to a globally optimal solution in selecting 
appropriate algorithms such as signal- and image-processing algorithms. Because of the 
aforementioned complexity, many data-mining tools neither provide guidance on how to process 
temporally and spatially sampled data nor are capable of processing sampled data. One embodiment 
disclosed herein automatically selects an appropriate set of DSP and IP algorithms based on problem 
context and data characteristics. 

[0013] In general, known approaches provide specific algorithms dealing with special 

application areas. Some, for example, relate to algorithms that may be useful in analyzing 
physiological data. Others relate to algorithms that may be useful in analyzing econometric data. 
Still others relate to algorithms that may be useful in analyzing geometric data. Each of these 
approaches therefore explores a comparatively small subset of the algorithm space. 
[0014] Known data mining tools lack a general capability to process sampled data without 

a priori knowledge about the problem domain. Even with prior knowledge about the problem 
domain, preprocessing can often be done only by algorithm experts. Such experts must write their 
own computer programs to convert sampled data into a set of feature vectors, which can then be 
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processed by a data mining tool The above described and other approaches in the areas of DSP and 
IP explore specific approaches developed for different application areas by algorithm experts. 
[0015] A disadvantage of such approaches is that developing highly tailored DSP and IP 

algorithms for each application domain is painstakingly tedious and time consuming. Because such 
approaches are painstakingly tedious and time consuming, most developers looking for algorithms 
explore only a small subset of the algorithm universe. Exploring only a small subset of the 
algorithm universe may result in sub-optimal performance. Furthermore, the requirement for such 
algorithm expertise may prevents users from extracting the highest level of knowledge from their 
data in a cost-efficient manner. 

[0016] There remains a need, therefore, for a solution that will, in at least some 

embodiments, automatically select appropriate algorithms based on the problem data set supplied 
and convert raw data into a set of features that can be mined. 

SUMMARY 

[0017] The invention, together with the advantages thereof, may be understood by 

reference to the following description in conjunction with the accompanying figures, which illustrate 
some embodiments of the invention. 

[0018] One embodiment is a method to identify a preprocessing algorithm for raw data. 

This method may include providing an algorithm knowledge database with preprocessing algorithm 
data and feature set data associated with the preprocessing algorithm data, analyzing raw data to 
produce analyzed data, extracting from the analyzed data features that characterize the data, and 
selecting a preprocessing algorithm using the algorithm knowledge database and features extracted 
from the analyzed data. The raw data may be DSP data or IP data. DSP data may be analyzed using 
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TFR-space transformation, phase map representation, and/or detection/clustering. IP data may be 
analyzed using detection/segmentation and/or ROI shape characterization. The method may also 
include data preparation and/or evaluating the selected preprocessing algorithm. Data preparation 
may include conditioning/preprocessing, Constant False Alarm Rate ("CFAR") processing, and/or 
adaptive integration. Conditioning/preprocessing may include interpolation, transformation, 
normalization, hardlimiting outliers, and/or softlimiting outliers. The method may also include 
updating the algorithm knowledge base after evaluating the selected preprocessing algorithm. 
[0019] Another embodiment is a data mining system for identifying a preprocessing 

algorithm for raw data. The data mining system includes (i) at least one memory containing an 
algorithm knowledge database and raw data for processing and (ii) random access memory with a 
computer program stored in it. The random access memory is coupled to the other memory so that 
the random access memory is adapted to receive (a) a data analysis program to analyze raw data, (b) 
a feature extraction program to extract features from raw data, and (c) an algorithm selection 
program to identify a preprocessing algorithm. It is not necessary that the algorithm knowledge 
database and the raw data exist simultaneously on just one memory. In an alternative embodiment, 
the algorithm knowledge database and the raw data for processing may be contained in and spread 
across a plurality of memories. These memories may be any type of memory known in the art 
including, but not limited to, hard disks, magnetic tape, punched paper, a floppy diskette, a CD- 
ROM, a DVD-ROM, RAM memory, a remote site accessible by any known protocall, or any other 
memory device for storing data. The data analysis program may include a DSP data analysis 
program and/or an IP data analysis program. The DSP data analysis program may be able to 
perform TFR-space transformation, phase map representation, and/or detection/clustering. The IP 
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data analysis program may be able to perform detection/segmentation and/or ROI shape 
characterization. The random access memory may also receive a data preparation subprogram 
and/or an algorithm evaluation subprogram. The data preparation program may include a 
conditioning/preprocessing subprogram, a CFAR processing subprogram, and/or an adaptive 
integration subprogram. The conditioning/preprocessing subprogram may includes interpolation, 
transformation, normalization, hardlimiting outliers, and/or softlimiting outliers. The algorithm 
evaluation program may update the algorithm knowledge database contained in the memory. 
[0020] Another embodiment is a data mining application that includes (a) an algorithm 

knowledge database containing preprocessing algorithm data and feature set data associated with the 
preprocessing algorithm data; (b) a data analysis module adapted to receive control of the data 
mining application when the data mining application begins; (c) a feature extraction module adapted 
to receive control of the data mining application from the data analysis module and available to 
identify a set of features; and (d) an algorithm selection module available to receive control from the 
feature extraction module and available to identify a preprocessing algorithm based upon the set of 
features identified by the feature extraction module using the algorithm knowledge database. The 
algorithm selection module may select a DSP algorithm and/or an IP algorithm. The algorithm 
selection module may use energy compaction capabilities, discrimination capabilities, and/or 
correlation capabilities. The data analysis module may use a short-time Fourier transform coupled 
with LPC analysis, a compressed phase-map representation, and/or a detection/clustering process if 
the data selection process will select a DSP algorithm. The data analysis module may use a 
procedure operable to provide at least one a ROI by segmentation, a procedure to extract local shape 
related features from a ROI; a procedure to extract two-dimensional wavelet features characterizing 
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a ROI; and/or a procedure to extract global features characterizing all ROIs if the algorithm selection 
module will select an IP algorithm. The detection/clustering process may be an expectation 
maximization algorithm or may include procedures that set a hit detection threshold, identify phase- 
space map tiles, count hits in each identified phase-space map tile, and detect the phase-space map 
tiles for which the hits counted exceeds the hit detection threshold. The data mining application may 
also include an advanced feature extraction module available to receive control from the algorithm 
selection module and to identify more features for inclusion in the set of features. It may also 
include a data preparation module available to receive control after the data mining application 
begins, in which case the data analysis module is available to receive control from the data 
preparation module. It may also include an algorithm evaluation module that evaluates performance 
of the preprocessing algorithm identified by the algorithm selection module and which may update 
the algorithm knowledge database. The data preparation module may include a 
conditioning/preprocessing process, a CFAR processing process and/or an adaptive integration 
process. The conditioning/preprocessing process may perform interpolation, transformation, 
normalization, hardlimiting outliers, and/or softlimiting outliers. Adaptive integration may include 
subspace filtering and/or kernel smoothing. 

[0021] Another embodiment is a data mining product embedded in a computer readable 

medium. This embodiment includes at least one computer readable medium with an algorithm 
knowledge database embedded in it and with computer readable program code embedded in it to 
identify a preprocessing algorithm for raw data. The computer readable program code in the data 
mining product includes computer readable program code for data analysis to produce analyzed data 
from the raw data, computer readable program code for feature extraction to identify a feature set 
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from the analyzed data, and computer readable program code for algorithm selection to identify a 
preprocessing algorithm using the analyzed data and the algorithm knowledge database. The 
computer readable program code may also include computer readable program code for algorithm 
evaluation to evaluate the preprocessing algorithm selected by the computer readable program code 
for algorithm selection. The data mining product need not be contained on a single article of media 
and may be embedded in a plurality of computer readable media. The computer readable program 
code for data analysis may include computer readable program code for DSP data analysis and/or 
computer readable program code for IP data analysis. The computer readable program code for DSP 
data analysis may include computer readable program code for TFR-space transformation, computer 
readable program code for phase map representation and/or computer readable program code for 
detection/clustering. The computer readable program code for IP data analysis may include 
computer readable program code for detection/segmentation and/or computer readable program code 
for ROI shape characterization. The computer readable program code for algorithm evaluation may 
be operable to modify the algorithm knowledge database. The data mining product may also include 
computer readable program code for data preparation to produce prepared data from the raw data, in 
which the computer readable program code for data analysis operates on the raw data after it has 
been transformed into the prepared data. The computer readable program code for data preparation 
may include computer readable program code for conditioning/preprocessing, computer readable 
program code for CFAR processing, and/or computer readable program code for adaptive 
integration. The computer readable program code for conditioning/preprocessing may include 
computer readable program code for interpolation, computer readable program code for 
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transformation, computer readable program code for normalization, computer readable program code 
for hardlimiting outliers, and/or computer readable program code for softlimiting outliers. 

REFERENCE TO THE DRAWINGS 
[0022] Several features of the present invention are further described in connection with the 

accompanying drawings in which: 

[0023] FIG. 1 is a program flowchart that generally depicts the sequence of operations in 

an exemplary program for automatic mapping of raw data to a processing algorithm. 
[0024] FIG. 2 is a data flowchart that generally depicts the path of data and the processing 

steps for an example of a process for automatic mapping of raw data to a processing algorithm. 
[0025] FIG. 3 is a system flowchart that generally depicts the flow of operations and data 

flow of one embodiment of a system for automatic mapping of raw data to a processing algorithm. 
[0026] FIG. 4 is a program flowchart that generally depicts the sequence of operations in 

an exemplary program for data preparation. 

[0027] FIG. 5 is a program flowchart that generally depicts the sequence of operations in 

an example of a program for data conditioning/preprocessing. 

[0028] FIG. 6 is a block diagram that generally depicts a configuration of one embodiment 

of hardware suitable for automatic mapping of raw data to a processing algorithm. 

[0029] FIG. 7 is a program flowchart that generally depicts the sequence of operations in 

one example of a program for automatic mapping of DSP data to a processing algorithm. 

[0030] FIG. 8 is a data flowchart that generally depicts the path of data and the processing 

steps for one embodiment of automatic mapping of DSP data to a processing algorithm. 
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[0031] FIG. 9 is a system flowchart that generally depicts the flow of operations and data 

flow of a system for one embodiment of automatic mapping of DSP data to a processing algorithm. 

[0032] FIG. 10 is a program flowchart that generally depicts the sequence of operations in 

an exemplary program for automatic mapping of image data to a processing algorithm. 

[0033] FIG. 1 1 is a data flowchart that generally depicts the path of data and the processing 

steps for one embodiment of automatic mapping of image data to a processing algorithm. 

[0034] FIG. 12 is a system flowchart that generally depicts the flow of operations and data 

flow of one embodiment of a system for automatic mapping of image data to a processing algorithm. 

DETAILED DESCRIPTIONS OF EXEMPLARY EMBODIMENTS 
[0035] While the present invention is susceptible of embodiment in various forms, there is 

shown in the drawings and will hereinafter be described some exemplary and non-limiting 
embodiments, with the understanding that the present disclosure is to be considered an 
exemplification of the invention and is not intended to limit the invention to the specific 
embodiments illustrated. 

[0036] In one embodiment, a data mining system and method selects appropriate digital 

signal processing ("DSP") and image processing ("IP") algorithms based on data characteristics. 
One embodiment identifies preprocessing algorithms based on data characteristics regardless of 
application areas. Another embodiment quantifies algorithm effectiveness using discrimination, 
correlation and energy compaction measures to update continuously a knowledge database that 
improves algorithm performance over time. The embodiments may be combined in one combination 
embodiment. 
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[0037] In another embodiment, there is provided for time-series data a set of candidate DSP 

algorithms. The nature of a query posed regarding the time-series data will define a problem 
domain. Examples of such problem domains include demand forecasting, prediction, profitability 
analysis, dynamic customer relationship management (CRM), and others. As a function of problem 
domain and data characteristics, the number of acceptable DSP algorithms is reduced. DSP 
algorithms selected from this reduced set may be used to extract features that will succinctly 
summarize the underlying sampled data. The algorithm evaluates the effectiveness of each DSP 
algorithm in terms of how compactly it captures information present in raw data and how much 
separation the derived features provide in terms of differentiating different outcomes of the 
dependent variable. The same logic may be applied to IP. While the concept of class separation has 
been generally applied to classification (categorical processing), it is nonetheless applicable to 
prediction and regression because continuous outputs can be converted to discrete variables for 
approximate reasoning using the concept of class separation. In an embodiment where the 
dependent variable remains continuous, the more appropriate performance measure will be 
correlation, not discrimination. 

[0038] In another embodiment, raw time-series and image input data can be processed 

through low-complexity signal-processing and image-processing algorithms in order to extract 
representative features. The low-complexity features assist in characterizing the underlying data in a 
computationally inexpensive manner. The low-complexity features may then be ranked based on 
their importance. The effective low-complexity features will then be a subset including low 
complexity features of high ranking and importance. There is provided a performance database 
containing a historical record indicating how well various image- and signal-processing algorithms 
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perfonned on various types of data. Feature association next occurs in order to identify high- 
complexity features that have worked well consistently with the effective low-complexity features 
previously computed. Next, there are identified high-complexity signal- and image-processing 
algorithms from which the associated high-complexity features were extracted. Then the identified 
high-complexity algorithms are used in preprocessing to improve data-mining performance further 
iteratively. This procedure can work on an arbitrary level of granularity in algorithm complexity. 
[0039] An embodiment may initially perform computationally efficient processing in order 

to extract a set of features that characterizes the underlying macro and micro trends in data. These 
features provide much insight into the type of appropriate processing algorithms regardless of 
application areas and algorithm complexity. Thus, the data mining application in one embodiment 
may be freed of the requirement of any prior knowledge regarding the nature of the problem set 
domain. 

[0040] An example of one aspect of data mining operations that may be automated by one 

embodiment of the invention is automatic recommendation of advanced DSP and IP algorithms by 
finding a meaningful relationship between signal/image characteristics and appropriate processing 
algorithms from a performance database As a further example, another aspect of data mining 
operations that may be automated by one embodiment of the invention is DSP-based and/or IP- 
based preprocessing tools that automatically summarize information embedded in raw time-series 
and image data and quantify the effectiveness of each algorithm based on a combined measure of 
energy compaction and class separation or correlation. 

[0041] One embodiment the invention disclosed and claimed herein may be used, for 

example, as part of a complete data mining solution usable in solving more advanced applications. 
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One example of such an advanced application would be seismic data analysis. A further example of 
such an advanced application would be sonar, radar, IR, or LIDAR sensor data processing. 
[0042] One embodiment of this invention characterizes data using a feature vector and 

helps the user find a small number of appropriate DSP and IP algorithms for feature extraction. 
[0043] An embodiment of the invention comprises a data mining application with 

improved high-complexity preprocessing algorithm selection, the data mining application 
comprising an algorithm knowledge database including preprocessing algorithm data and feature set 
data associated with the preprocessing algorithm data; a data analysis module that is available to 
receive control after the data mining application begins; a feature extraction module that is available 
to receive control from the data analysis module and that is available to identify a set of features; and 
an algorithm selection module that is available to receive control from the feature extraction module 
and that is available to identify a preprocessing algorithm based upon the set of features identified by 
the feature extraction module using the algorithm knowledge database. The algorithm selection 
module may select a DSP algorithm using energy compaction, discrimination, and/or correlation 
capabilities. The data analysis module may use a short-time Fourier transform, a compressed phase- 
map representation, and/or a detection/clustering process. The detection/clustering process can 
include procedures that for setting a hit detection threshold, identifying phase-space map tiles, 
counting hits in each identified phase-space map tile, and/or detecting the phase-space map tiles for 
which the number of hits counted exceeds the hit detection threshold using an expectation 
maximization algorithm. The algorithm selection module may select an IP algorithm using energy 
compaction, discrimination, and/or correlation capabilities to select an IP algorithm. The data 
analysis module for an IP algorithm may comprise a procedure to provide at least one a region of 
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interest by segmentation and at least one procedure selected from the set of procedures including: a 
procedure to extract local shape related features from a region of interest; a procedure to extract two- 
dimensional wavelet features characterizing a region of interest; and a procedure to extract global 
features characterizing all regions of interest. The data mining application may also include an 
advanced feature extraction module available to receive control from the algorithm selection module 
and to identify more features for inclusion in the set of features and/or a data preparation module that 
is available to receive control after the data mining application begins, wherein the data analysis 
module is available to receive control from the data preparation module. The data analysis module 
may include conditioning/preprocessing, interpolation, transformation, and normalization. The 
conditioning/preprocessing process may perform adaptive integration. The data preparation module 
may include a CFAR processing process to identify and extract long term trend lines and adaptive 
integration, including subspace filtering and kernel smoothing. The data mining application may 
also include an algorithm evaluation module that evaluates performance of the preprocessing 
algorithm identified by the algorithm selection module and updates the algorithm knowledge 
database. 

[0044] Referring now to FIG. 1, there is illustrated a flowchart of an exemplary 

embodiment of a raw data mapping program (100) to map raw data automatically to an advanced 
preprocessing algorithm, which depicts the sequence of operations to map raw data automatically to 
an advanced preprocessing algorithm. When it begins, the raw data mapping program (100) initially 
calls a data preparation process (1 10). The data preparation process (1 10) can perform simple 
functions to prepare data for more sophisticated DSP or IP algorithms. Examples of the kinds of 
simple functions performed by the data preparation process (1 10) may include 
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conditioning/preprocessing, constant false alarm rate ("CFAR") processing, or adaptive integration. 
Some may perform wavelet-based multi-resolution analysis as part of preprocessing. In speech 
processing, preprocessing may include speech/non-speech separation. Speech/non-speech separation 
in essence uses LPC and spectral features to eliminate non-speech regions. Non-speech regions may 
include, for example, phone ringing, machinery noise, etc. Highly domain-specific algorithms can 
be added later as part of feature extraction and data mining. 

[0045] Referring still to the example illustrated in FIG. 1, when the data preparation 

process (110) completes, it calls a data analysis process (120). In one embodiment, for DSP data, 
the data analysis process (120) can perform functions such as time frequency representation space 
("TFR-space") transformation, phase map representation, and detection/clustering. Certain 
embodiments of processes to perform these exemplary functions for DSP data are further described 
below in connection with FIG. 7. In another embodiment, for IP data the data analysis process (120) 
can perform functions such as detection/segmentation and region of interest ("ROI") shape 
characterization. Certain embodiments of processes to perform these exemplary functions for IP 
data are further described below in connection with FIG. 10. 

[0046] Referring still to the illustrated embodiment in FIG. 1, when the data analysis 

process (120) completes, it calls a feature extraction process (130). The feature extraction process 
(130) extracts features that characterize the underlying data and may be useful to select an 
appropriate preprocessing algorithm. For example, an embodiment of the feature extraction process 
(130) may operate to identify features in DSP data such as a sinusoidal event or exponentially 
damped sinusoids or significant inflection points or anomalous events or predefined spatio-temporal 
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patterns in a template database. Another embodiment of the feature extraction process (130) may 
operate to identify features in IP data such as shape, texture, and intensity. 
[0047] As shown in FIG. 1, when the feature extraction process (130) of the illustrated 

example completes, it calls an algorithm selection process (140). The actual selection is based on a 
knowledge database that keeps track of which algorithms work best given the global-feature 
distribution and local-feature distribution. Global feature distribution concerns the distribution of 
features over an entire event or all events, whereas local feature distribution concerns the distribution 
of features from frame to frame or tick to tick, as in speech recognition. The objective function for 
the algorithm selection process (140) is based on how well features derived from each algorithm 
achieve energy compaction and discriminate among or correlate with output classes. The actual 
algorithm selection process (140) for algorithm selection based on the local and global features may 
perform using any of the known solution methods . For example, the algorithm selection process 
(140) may be based on a family of hierarchical pruning classifiers. Hierarchical pruning classifiers 
operate by continuous optimization of confusing hypercubes in the feature vector space sequentially. 
Instead of giving up after the first attempt at classification, a set of hierarchical sequential pruning 
classifiers can be created. The first-stage feature-classifier combination can operate on the original 
data set to the extent possible. Next, the regions with high overlap are identified as "confusing" 
hypercubes in a multi-dimensional feature space. The second-stage feature-classifier combination 
can then be designed by optimizing parameters over the surviving feature tokens in the confusing 
hypercubes. At this stage, easily separable feature tokens have been discarded from the original 
feature set. These steps can be repeated until a desired performance is met or the number of 
surviving feature tokens falls below a preset threshold. 
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[0048] Referring to the embodiment of FIG. 1, when the algorithm selection process (140) 

completes it calls an algorithm evaluation process (150) as shown. The data used by the algorithm 
selection process (140) are continuously updated by self-critiquing the selections made. Each 
algorithm may be evaluated based on any suitable measure for evaluating the selection including, for 
example, energy compaction and discrimination or correlation capabilities. 
[0049 ] Energy compaction criterion measures how well the signal-energy spread over 

multiple time samples can be captured in a small number of transform coefficients. Energy 
compaction may be measured by computing the amount of energy being captured by transform 
coefficients as a function of the number of transform coefficients. For instance, a transform 
algorithm that captures 90% of energy with the top three transform coefficients in time-series 
samples is superior to another transform algorithm that captures 70% of energy with the top three 
coefficients. Energy compaction is measured for each transform algorithm, which generates a set of 
transform coefficients. For instance, the Fourier transform has a family of sinusoidal basis 
functions, which transform time-series data into a set of frequency coefficients (i.e., transform 
coefficients). The less the number of transform coefficients with large magnitudes, the more energy 
compaction a transform algorithm achieves. Discrimination criteria assess the ability of features 
derived from each algorithm to differentiate target classes. Discrimination measures the ability of 
features derived from a transform algorithm to differentiate different target outcomes. In general, 
discrimination and energy compaction can go hand in hand based purely on probability arguments. 
Nevertheless, it may be desirable to combine the two in assessing the efficacy of a transform 
algorithm in data mining. Discrimination is directly proportional to how well an input feature 
separates various target outcomes. For a two-class problem, for example, discrimination is measured 
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by calculating the level of overlap between the two class-conditional feature probability density 
functions. Correlation criteria evaluate the ability of features to track the continuous target variable 
with an arbitrary amount of time lag. After completing the algorithm evaluation process (150), the 
exemplary program illustrated in FIG. 1 may end, as shown. 

[0050] Referring next to FIG. 2, there is disclosed a data flowchart that generally depicts 

the path of data and the processing steps for an example of a process (200) for automatic mapping of 
raw data to a processing algorithm. As shown, the process (200) begins with raw data (210), in 
whatever form. Raw data may be found in an existing database, or may be collected through 
automated monitoring equipment, or may be keyed in by manual data entry. Raw data can be in the 
form of Binary Large Objects (BLOBs) or one-to-many fields in the context of object-relational 
database. In other instances, raw data can be stored in a file structure. Highly normalized table 
structures in an object-oriented database may store such raw data in an efficient structure. Raw data 
examples include, but are not limited to, mammogram image data, daily sales data, macroeconomic 
data (such as the consumer confidence index, Economic Cycle Research Institute index, and others) 
as a function of time, and so on. The specific form and media of the data are not material to this 
invention. It is expected that it may be desirable to put the raw data (210) in a machine readable and 
accessible form by some suitable process. 

[0051] Referring still to the exemplary process (200) illustrated in FIG. 2, the raw data 

(210) flows to and is operated on by the data preparation process (1 10). Examples of the kinds of 
simple functions performed by the data preparation process (110) may include 
conditioning/preprocessing, CFAR processing, or adaptive integration. After the raw data (210) are 
subjected to these various functions or any of them, the result is a set of prepared data (220). The 
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prepared data (220) flows to and is operated on by the data analysis process (120). In an 
embodiment in which the prepared data (220) is DSP data, the data analysis process (120) may 
perform the functions of TFR-space transformation, phase map representation, and 
detection/clustering, examples of which are further described in the embodiment depicted in FIG. 7. 
In another embodiment in which the prepared data (220) is IP data, the data analysis process (120) 
may perform the functions of detection/segmentation and ROI shape characterization, examples of 
which are further described in the embodiment depicted in FIG. 10. The result is that prepared data 
(220), whether DSP data or IP data, is transformed into analyzed data (230) which is descriptive of 
the characteristics of the prepared data (220). 

[0052] In the example process (200) illustrated in FIG. 2, the analyzed data (230) flows to 

and is operated on by the feature extraction process (130), which extracts local and global features. 
For example, in an embodiment that operates on raw data (210) that is DSP data, the feature 
extraction process (130) may characterize the time-frequency distribution and phase-map space. As 
another example, in an embodiment that operates on raw data (210) that is IP data, the feature 
extraction process (130) may characterize features such as texture, shape, and intensity. The result 
in the illustrated embodiment will be feature set data (240) containing information that characterizes 
the raw data (210) as transformed into prepared data (220) and analyzed data (230). 
[0053] Referring still to the example of FIG. 2, feature set data (240) flows to and is 

operated on by the algorithm selection process (140), which in the illustrated embodiment performs 
its processing using information stored in an existing algorithm knowledge database (260). The 
actual algorithm knowledge database (260) in this example may be based on how each algorithm 
contributes to energy compaction and discrimination in classification or correlation in regression. 



PATENT 
7648/81349 
00SC105,111 

The algorithm knowledge database (260) may be filled based on experiences with knowledge 
extraction from various time-series and image data. The algorithm selection process (140) identifies 
processing algorithms (250). These processing algorithms (250) then flow to and are operated upon 
by the algorithm evaluation process (150), which in turn updates the algorithm knowledge database 
(260) as illustrated by line 261. The final output of the program is, first, the processing algorithms 
(250) that will be used by a data mining application to analyze data and, second, an updated 
algorithm knowledge database (260), that will be used for future mapping of raw data (210) to 
processing algorithms (250) 

[0054] Referring next to FIG. 3, there is shown a system flowchart that generally depicts 

the flow of operations and data flow of an embodiment of a system (300) for automatic mapping of 
raw data to a processing algorithm. This FIG. 3 depicts not only data flow, but also control flow 
between processes for the illustrated embodiments. The individual data symbols, indicating the 
existence of data, and process symbols, indicating the operations to be performed on data, are 
described further in connection with FIG. 1 above and FIG. 2 above. When it begins, this example 
process (300) initially calls a data preparation process (110). The data preparation process (110) 
operates on raw data (210) to produce prepared data (220), then when it is finished calls the data 
analysis process (120). The data analysis process (120) operates on prepared data (220) to produce 
analyzed data (230), then when it is finished calls the feature extraction process (130). The feature 
extraction process (130) operates on analyzed data (230) to produce feature set data (240), then when 
it is finished calls the algorithm selection process (140). The algorithm selection process (140) uses 
the algorithm knowledge database (260) and operates on the feature set data (240) to identify 
processing algorithms (250), then when it is finished calls the algorithm evaluation process (150). 
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The algorithm evaluation process (150) evaluates the identified processing algorithms (250), then 
uses the results of its evaluation to update the algorithm knowledge database (260) in the 
embodiment illustrated in FIG. 3. In another embodiment (not shown) an algorithm knowledge 
database may be predetermined and not updated. After the algorithm evaluation process (150) 
completes, the program may end. 

[0055] Referring next to FIG. 4, there is disclosed a program flowchart depicting a specific 

example of a suitable data preparation process (1 10). This data preparation process (1 10) performs a 
series of preferably computationally inexpensive operations to render data more suitable for 
processing by other algorithms in order better to identify data mining preprocessing algorithms. 
Before using relatively more sophisticated DSP or IP algorithms, it may be advantageous first to 
process the raw time series or image data through relatively low complexity DSP and IP algorithms. 
The relatively low complexity DSP and IP algorithms may assist in extracting representative 
features. These low complexity features may also assist in characterizing the underlying data. One 
benefit of an embodiment of this invention including such relatively low-complexity preprocessing 
algorithms is that this approach to characterizing the underlying data is relatively inexpensive 
computationally. 

[0056] When the embodiment of the data preparation process (110) illustrated in FIG. 4 

begins, it calls first a conditioning/preprocessing process (410). The conditioning/preprocessing 
process (110) may perform various functions including interpolation/decimation, transformation, 
normalization, and hardlimiting or softlimiting outliers. These functions of the 
conditioning/preprocessing process (410) may serve to fill in missing values and provide for more 
meaningful processing. 
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[0057] Referring still to the example of FIG. 4, when the data preparation process (110) 

ends, it calls a constant false alarm-rate ("CFAR") processing process (420), which may operate to 
eliminate long term trend lines and seasonal fluctuations. The CFAR processing process (420) may 
further operate to accentuate sharp deviations from recent norm. When long term trend lines are 
eliminated and sharp deviations from recent norms are accentuated, later processing algorithms can 
focus more accurately and precisely on transient events of high significance that may mark the onset 
of a major trend reversal. In an embodiment including a CFAR processing process (420), long term 
trends may be annotated as up or down with slope to eliminate long term trend lines while 
emphasizing sharp deviations from recent norms. One example of CFAR processing involves the 
following three steps: (1) estimation of local noise statistics around the test token, (2) elimination of 
outliers from the calculation of local noise statistics, and (3) normalization of the test token by the 
estimated local noise statistics. The output data is a normalized version of the input data. 
[0058] The constant-false-alarm-rate processing process (420) may identify critical points 

in the data. Such a critical point may reflect, for example, an inflection point in the variable to be 
predicted. As a further example, such a critical point may correspond to a transient event in the 
observed data. In general, the signals comprising data indicating these critical points may be 
interspersed with noise comprising other data corresponding to random fluctuations. It may be 
desirable to improve the signal-to-noise ratio in the data set through an additional processing step. 
[0059] Because the CFAR processing process (420) tends to amplify small perturbations in 

data, the effect of small, random fluctuations may be exaggerated. It may therefore be desirable in 
some embodiments to reduce the sensitivity of the processing to fluctuations reflected in only one or 
a similarly comparatively very small number of observations. Referring still to the embodiment 
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illustrated in FIG. 4, when the CFAR processing process (420) ends, it calls an adaptive integration 
process (430) to improve the signal-to-noise ratio of inflection or transient events. The adaptive 
integration process (430) may, for example, perform subspace filtering to separate data into signal 
and alternative subspaces. The adaptive integration process (430) may also perform smoothing, for 
example, Viterbi line integration and/or kernel smoothing, so that the detection process is not overly 
sensitive to small, tick-by-tick fluctuations. Adaptive integration may perform trend-dependent 
integration and is particularly useful in tracking time-varying frequency line structures such as may 
occur in speech and sonar processing. It can keep track of line trends over time and hypothesize 
where the new lines should continue, thereby adjusting integration over energy and space 
accordingly. Typical integration cannot accommodate such dynamic behaviors in data structure. 
Subspace filtering utilizes the singular value decomposition to divide data into signal subspace and 
alternate (noise) subspace. This filtering allows focus on the data structure responsible for the signal 
component. Kernel smoothing uses a kernel function to perform interpolation around a test token. 
The smoothing results can be summed over multiple test tokens so that the overall probability 
density function is considerably smoother than the one derived from a simple histogram by hit 
counting. 

[0060] Referring now to FIG 5, there is disclosed a program flowchart depicting an 

example of a process that may be performed as part of the conditioning/preprocessing process (410). 
In one embodiment, when the conditioning/preprocessing process (410) begins, it first calls an 
interpolation process (510). Interpolation can be linear, quadratic, or highly nonlinear (quadratic is 
nonlinear) through transformation. An example of such nonlinear transformation is Stolt 
interpolation in synthetic -aperture radar with spotlight processing. In general, the nearest N samples 
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to the time point desired to be estimated are found and interpolation or oversampling is used to fill-in 
the missing time sample. The interpolation process (510) may be used in the conditioning module to 
fill in missing values and to align samples in time if sampling intervals differ. When the 
interpolation process (510) ends, it calls a transformation process (520), which transforms data from 
one space into another. Transformation may encompassfor example, difference output, scaling, 
nonlinear mathematical transformation, composite-index generation based on multiple channel data. 
[0061] The transformation process (520) may then call a normalization process (530) for 

more meaningful processing. For example, in an embodiment analyzing financial data, the financial 
data may be transformed by the transformation process (520) and normalized by the normalization 
process (530) for more meaningful interpretation of macro trends not biased by short-term 
fluctuations, demographics, and inflation. Transformation and normalization do not have to occur 
together, but they generally complement each other. Normalization eliminates long-term trends (and 
may therefore be useful in dealing with non-stationary noise) and accentuates momentum-changing 
events, while transformation maps input data samples in the input space to transform coefficients in 
the transform space. Normalization can detrend data to eliminate long-term easily predictable 
patterns. For instance, the stock market may tend to increase in the long term. Some may be 
interested in inflection points, which can be accentuated with normalization. Transformation maps 
data from one space to another. When the normalization process (530) ends control in the example 
of FIG. 5 may then flow to a hard^imitmg/softlimiting outliers process (540). 
[0062 ] The hardlimiting/softlimiting outliers process (540) may act to confine observations 

within certain boundaries so as to restrict exaggerated effects from isolated, extreme observations by 
clipping or transformation. Outliers are defined as those that are far different from the norm. They 
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can be identified in terms of Euclidean distance. That is, if a distance between the centroid and a 
scalar or vector test token normalized by variance for scalar or covariance matrix for vector 
attributes exceeds a certain threshold, then the test token is labeled as an outlier and can be thrown 
out or replaced. Replacing all the outliers with the same value is hardlimiting, while softlimiting 
assigns a much smaller dynamic range in mapping the outliers to a set of numbers (i.e., hyperbolic 
tangent, sigmoid, log, etc.). A standard set of parameters will be provided for novice users, while 
expert users can change their values. When the hardlimiting/softlimiting outliers process (540) 
concludes, the illustrated conditioning/preprocessing process (410) ends. It is not necessary that 
each of these processes be performed for conditioning/preprocessing, nor is it required that they be 
performed in this specific order. For example, in another embodiment of the 
conditioning/preprocessing process (410), the interpolation/decimation process (510) or any of the 
other processes (520) (530) (540) may be omitted. In still another embodiment of the conditioning 
preprocessing process (410), the hardlimiting/softlimiting outliers process (540) may be called first 
rather than last. Other sequences and combinations are possible, and are considered to be equivalent 
to the specific embodiments here described, as are all other low complexity 
conditioning/preprocessing algorithms now know or hereafter developed. 

[0063] Referring now to FIG. 6, there is disclosed a block diagram that generally depicts an 

example of a configuration (600) of hardware suitable for automatic mapping of raw data to a 
processing algorithm. A general-purpose digital computer (601) includes a hard disk (640), a hard 
disk controller (645), ram storage (650), an optional cache (660), a processor (670), a clock (680), 
and various I/O channels (690). In one embodiment, the hard disk (640) will store data mining 
application software, raw data for data mining, and an algorithm knowledge database. Many 



PATENT 
7648/81349 
00SC105,111 

different types of storage devices may be used and are considered equivalent to the hard disk (640), 
including but not limited to a floppy disk, a CD-ROM, a DVD-ROM, an online web site, tape 
storage, and compact flash storage. In other embodiments not shown, some or all of these units may 
be stored, accessed, or used off-site, as, for example, by an internet connection. The I/O channels 
(690) are communications channels whereby information is transmitted between RAM storage and 
the storage devices such as the hard disk (640). The general-purpose digital computer (601) may 
also include peripheral devices such as, for example, a keyboard (610), a display (620), or a printer 
(630) for providing run-time interaction and/or receiving results. Prototype software has been tested 
on Windows 2000 and Unix workstations. It is currently written in Matlab and C/C++. Two 
embodiments are currently envisioned — client server and browser-enabled. Both versions will 
communicate with the back-end relational database servers through ODBC (Object Database 
Connectivity) using a pool of persistent database connections. 

[0064] Referring now to FIG. 7, there is disclosed a program flowchart of an exemplary 

embodiment of a DSP data mapping program (700). When the DSP data mapping program begins it 
calls a data preparation process (110) to perform simple functions such as 
conditioning/preprocessing, CFAR processing, or adaptive integration. This data preparation 
process may fill, smooth, transform, and normalize DSP data. When tie data preparation process 
(1 10) has completed, it calls a DSP data analysis process (720). This illustrated DSP data analysis 
process (720) is one embodiment of a general data analysis process (120) described above in 
connection with FIG. 1. 

[0065] TFR-space relates generally to the spectral distribution of how significant events 

occur over time. The DSP data analysis process (720) may include a TFR-space transformation sub- 



PATENT 
7648/81349 
00SC105,111 

process (724) activated as part of the DSP data analysis process (720). In one embodiment of the 
DSP data mapping program (700), the TFR-space transformation sub-process (724) may use the 
short-time Fourier transform ("STFT"). An advantage of the STFT (in those embodiments using the 
STFT) is that it is more computationally efficient than other more elaborate time-frequency 
representation algorithms. The STFT applies the Fourier transform to each frame. The entire time- 
series data is divided into multiple overlapping time frames, where each frame spans a small subset 
of the entire data. Each time frame is converted into transform coefficients. Essentially, an N-point 
time series is mapped onto an M-by-(N*2/M-l) matrix (with 50% overlap between the two 
consecutive time frames), where M is the number of time samples in each frame. For instance, a 
1024-point time series can be converted into a 64-by-3 1 TFR matrix with 50% overlap and 64-point 
FFT (M = 64). On the other hand, LPC analysis can reduce 64-FFT coefficients to a much smaller 
set for even greater compression if the input data exhibit harmonic frequency structures. Other TFR 
functions include quadratic functions such as Wigner-Ville, Reduced Interference Distribution, 
Choi-Williams Distribution, and others. Still other TFR functions include a highly nonlinear TFR 
such as Ensemble Interval Histogram. 

[0066] Referring still to the embodiment of FIG. 7, the DSP data analysis process (720) 

may include a phase map representation sub-process (722). Phase map representation relates 
generally to the occurrence over time of similar events. The phase-map representation sub-process 
(722) may be effective to detect the presence of low dimensionality in non-linear data and to 
characterize the nature of local signal dynamics, as well as helping identify temporal relationships 
between inputs and outputs. The phase map representation sub-process (722) may be activated as 
soon as the DSP data analysis process (720) begins, and in general need not await completion of the 
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TFR-space transformation sub-process (724). We can generate a phase map by dividing time-series 
data into a set of highly overlapping frames (similar to the TFR-space transformation). Instead of 
applying frequency transformation as in the TFR, we simply create an embedded data matrix, where 
each column holds either raw samples or principal components of the frame data. The resulting 
structure again is a matrix. Each column vector spans a phase-map vector space, in which we can 
trace trajectories of the system dynamical behavior over time. 

[0067] Referring still to the embodiment illustrated in FIG. 7, when the TFR-space 

transformation sub-process (724) and the phase map representation sub-process (722) complete, they 
JJ may call a detection/clustering sub-process (726), which also operates on the preprocessed data of 
01 magnitude with respect to time. It may be desirable in an embodiment to calculate intensity in TFR 
S,j space. In an embodiment of the DSP data mining program (700) that includes the 
« detection/clustering sub-process (726), phase map-space may be divided into tiles. The number of 
t? hits per tile may then be tabulated by calculating how many of the observations fall within the 
W boundaries of each tile in phase-map space. Tiles for which the count exceeds a detection threshold 
may then be grouped spatially into clusters, thereby facilitating the compact description of tiles with 
the concept of fractal dimension. In one embodiment that detection threshold may be predetermined. 
In another embodiment that detection threshold may be computed dynamically based on the 
characteristics and performance of the data in the detection/clustering sub-process (726). In still 
another embodiment, phase-map space clustering may be based on an expectation-maximization 
algorithm. When the detection/clustering sub-process (726) ends, the DSP data analysis process 
(720) has finished. 
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[0068] Referring still to the exemplary embodiment illustrated in FIG. 7, when the DSP 

data analysis process (720) ends, it calls a DSP feature extraction process (730). The DSP feature 
extraction process (730) may perform functions to evaluate features of the time frequency 
representation. The actual distribution of clusters may provide insight into how significant events 
are distributed over time in a TFR space and when similar events occur in time in the phase map 
representation. Local features may be extracted from each cluster or frame and global features from 
the entire distribution of clusters. The local-feature set encompasses geometric shape-related 
features (for example, a horizontal line in the TFR space and a diagonal tile structure in the phase- 
map space would indicate a sinusoidal event), local dynamics estimated from the corresponding 
phase-map space, and LPC features from the corresponding time-series segment. The global-feature 
set may include the overall time-frequency distribution in TFR-space and the hidden Markov model 
that represents the cluster distribution in a phase map representation. 

[0069] In the embodiment of FIG. 7, when the DSP feature extraction process (730) ends it 

calls the DSP algorithm selection process (740). The DSP algorithm selection process (740) may 
select an appropriate subset of DSP algorithms from an algorithm library as a function of the local 
and global features. Actual selection may be based on a knowledge database that keeps track of 
which DSP algorithms work best given the global-feature and local-feature distribution. The 
objective function for selecting the best algorithm given the input features is based on how well 
features derived from each DSP transformation algorithm achieve energy compaction and 
discriminate output classes. For example, if the local features indicate the presence of a sinusoidal 
event as indicated by a long horizontal line in the TFR space, the Fourier transform may be the 
optimal choice. On the other hand, if the local features imply the presence of exponentially damped 
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sinusoids, the Gabor transform may be invoked. The Hough transform may be useful for identifying 
line-like structures of arbitrary orientation in images. A one-dimensional discrete cosine transform 
(DCT) is appropriate for identifying vertical or horizontal line-like structures (in particular, sonar 
grams in passive narrow-band processing) in images. Two-dimensional DCT or wavelets may be 
useful for identifying major trends. Viterbi algorithms may be useful for identifying wavy-line 
structures. Meta features may also be extracted that describe raw data, much like meta features that 
describe features, and that can shed insights into appropriate DSP and/or IP algorithms. 
[0070] Referring still to the embodiment of FIG. 7, when the DSP algorithm selection 

process ends it calls a DSP algorithm evaluation process (750). The DSP algorithm evaluation 
process (750) is one embodiment of the more general algorithm evaluation process (150) described 
above in reference to FIG. 1. The DSP algorithm evaluation process (750) evaluates the DSP 
algorithm selected by the DSP algorithm selection process (740). The DSP algorithm evaluation 
process (750) bases its evaluation on energy compaction and discrimination/correlation capabilities. 
The DSP algorithm evaluation process may also update a knowledge database used by the DSP 
algorithm selection process (740). When the DSP algorithm evaluation process (750) ends, the DSP 
data mapping program (700) has completed. 

[0071] Referring now to FIG. 8, there is disclosed a data flowchart that depicts generally 

the path of data and the processing steps for a specific example of automatic mapping of DSP data to 
a processing algorithm. The data begins in the form of raw DSP data (810), which is time-series 
data. This data may reside in an existing database, or may be collected using sensors, or may be 
keyed in by the user to capture it in a suitable machine-readable form. The raw DSP data (810) 
flows to and is operated on by the data preparation process (1 10), which may function to smooth, 
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fill, transform, and normalize the data resulting in prepared data (220). The prepared data (220) next 
flows to and is operated on by a DSP data analysis process (720). The DSP data analysis process 
(720) may perform the function of TFR-space transformation to produce TFR-space data (820). The 
DSP data analysis process (720) may also perform the function of phase map representation to 
produce phase-map representation data (830). The DSP data analysis process (720) may also use 
TFR-space data (820) and phase map representation data (830) to perform the function of 
detection/clustering to produce vector summarization data (840). In general, the output is 
summarized in a vector. In storm image analysis for example, each storm cell is summarized in a 
vector of spatial centroid, time stamp, shape statistics, intensity statistics, gradient, boundary, and so 
forth. The TFR-space data (820), phase map representation data (830), and vector summarization 
data (840) next flow to and are operated on by the DSP feature extraction process (730) to produce 
feature set data (240). The feature set data (240) next flows to and is operated on by the DSP 
algorithm selection process (740), which uses the knowledge database (260) to select a set of DSP 
algorithms that are then included in DSP algorithm set data (850). The DSP algorithm set data (850) 
next flows to and is operated on by the DSP algorithm evaluation process (750), which in turn 
updates the knowledge database (260). After selection of advanced DSP algorithms from the 
knowledge database, control passes to an advanced DSP feature extraction process (860) where 
advanced DSP features are extracted and appended to the original feature set. The final results are, 
first, the DSP algorithm set data (850), second, the updated knowledge database (260), and third the 
composite feature set derived from both basic and advanced DSP algorithms. 

[0072] Referring now to FIG. 9, there is shown a system flowchart that generally depicts 

the flow of operations and data flow of an example of a system for automatic mapping of DSP data 
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to a processing algorithm. The individual data symbols, indicating the existence of data, and process 
symbols, indicating the operations to be performed on data, are as described in connection with FIG. 
7 above and FIG. 8 above. When it begins, the program control initially passes to the data 
preparation process (110). This process operates on raw DSP data (810) to produce prepared data 
(220), then when it is finished passes control to the DSP data analysis process (720). The DSP data 
analysis process (720) operates on prepared data (220) to produce TFR-space data (820) phase map 
representation data (830) and vector histogram data (840), then when it is finished passes control to 
the DSP feature extraction process (730). The DSP feature extraction process (730) operates on 
TFR-space data (820), phase map representation data (830), and vector histogram data (840), to 
produce feature set data (240), then when it is finished passes control to the DSP algorithm selection 
process (740). The DSP algorithm selection process (740) uses the algorithm knowledge database 
(260) and operates on the feature set data (240) to produce DSP algorithm set data (850), then when 
it is finished passes control to the DSP algorithm evaluation process (750). The DSP algorithm 
evaluation process (750) evaluates the DSP algorithm set data (850), then uses the results of its 
evaluation to update the algorithm knowledge database (260). After the DSP algorithm evaluation 
process (750) completes, the program may end. 

[0073] Referring now to FIG. 10, there is disclosed a program flowchart of one 

embodiment of an IP data mapping program (1000). When the IP data mapping program begins 
control starts with a data preparation process (110) to perform simple functions such as 
conditioning/preprocessing, CFAR processing, or adaptive integration. This data preparation 
process (110) may fill, smooth, transform, and normalize DSP data. When the data preparation 
process (110) has completed, it calls an IP data analysis process (1020). This IP data analysis 
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process (1020) is one embodiment of a general data analysis process (120) described above in 
connection with FIG. 1 . 

[0074] Referring still to the embodiment of FIG. 10, the IP data analysis process (1020) 

may include a detection/segmentation sub-process (1023) and a region of interest ("ROI") shape 
characterization sub-process (1026). The detection/segmentation sub-process (1023) detects and 
segments the ROI. A detector first looks for certain intensity patterns such as bright pixels followed 
by dark ones in underwater imaging applications. After detection, any pixel that meets the detection 
criteria will be marked to be considered for segmentation. Next, spatially similar marked pixels are 
clustered to generate clusters to be processed later through feature extraction and data mining. The 
ROI shape characterization sub-process (1026) then identifies local shape-related and intensity- 
related characteristics of each ROI. In addition, the ROI shape characterization sub-process (1026) 
may identify two-dimensional wavelets to characterize texture. Two-dimensional wavelets divide an 
image in terms of frequency characteristics in both spatial dimensions. Shape-related features 
encompass statistics associated with edges, wavelet coefficients, and the level of symmetry. 
Intensity-related features may include mean, variance, skewness, kurtosis, gradient in radial 
directions from the centroid, and others. When the detection/segmentation sub-process (1023) and 
the ROI shape characterization sub-process (1026) complete, the IP data analysis process (1020) 
may also terminate. 

[0075] In the example of FIG. 10, when the IP data analysis process (1020) terminates, 

control passes to a ROI feature extraction process (1030). The ROI feature extraction process 
(1030) extracts global features from each image that characterizes the nature of all ROI snippets 
identified as clusters. The ROI feature extraction process (1030) also extracts local shape-related 
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features, intensity-related features, and other local features from each ROI. When the ROI feature 
extraction process (1030) terminates, control passes to an IP algorithm selection process (1040). The 
IP algorithm selection process (1040) selects an appropriate subset of IP algorithms from an 
algorithm library as a function of me local and global features. The actual selection is based on a 
knowledge database that keeps track of which IP algorithms work best given the global-feature and 
local-feature distribution. The objective function for selecting the best algorithm given the input 
features is based on how well features derived from each IP transformation algorithm achieve energy 
compaction and discriminate output classes. 

[0076] Referring still to the example of FIG. 10, when the IP algorithm selection process 

(1040) terminates, control passes to an IP algorithm evaluation process (1050). The IP algorithm 
evaluation process (1050) is an embodiment of the more general algorithm evaluation process (150) 
described above in reference to FIG. 1. The IP algorithm evaluation process (1050) evaluates the IP 
algorithm selected by the IP algorithm selection process (1040). The IP algorithm evaluation 
process (1050) of the illustrated embodiment bases its evaluation on energy compaction and 
discrimination capabilities. The IP algorithm evaluation process may also update a knowledge 
database used by the ISP algorithm selection process (1040). When the IP algorithm evaluation 
process (1050) ends, the IP data mapping program (1000) has completed. 

[0077] Referring now to FIG. 11, there is disclosed a data flowchart that generally depicts 

the path of data and the processing steps for a specific example of automatic mapping of IP data to 
an appropriate IP processing algorithm. The data begins in the form of raw IP data (1110). This 
data may reside in an existing database, or may be collected using spatial sensors, or may be keyed 
in by the user to capture it in a suitable machine-readable form. Under certain conditions, spatial 
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sensors such as radar, sonar, infrared, and the like will require some preliminary processing to 
convert time-series data into IP data. The raw IP data (1110) flows to and is operated on by the data 
preparation process (110), which may function to smooth, fill, transform, and normalize the data 
resulting in prepared data (220). The prepared data (220) next flows to and is operated on by an IP 
data analysis process (1020). 

[0078] The IP data analysis process (1020) in the embodiment of FIG. 1 1 may perform the 

functions detection/segmentation and ROI space characterization to produce segmented ROI with 
characterized shapes data (1 120). First, after preprocessing (cleaning and integration), all the pixels 
that are unusually bright or dark in comparison to the neighboring pixels are detected as a form of 
CFAR processing. Second, detected pixels are spatially clustered to segment each ROI. From each 
ROI, features are extracted to describe shape, intensity, texture, and gradient. The resulting data 
should be in the form of a matrix, where each column represents features associated with each 
detected cluster. The segmented ROI with characterized shapes data (1 120) next flows to and is 
operated on by the IP feature extraction process (730) to produce feature set data (240). The feature 
set data (240) next flows to and is operated on by the IP algorithm selection process (1040), which 
uses the knowledge database (260) to select a set of IP algorithms that are then included in IP 
algorithm set data (1 130). The IP algorithm set data (1 1 30) next flows to and is operated on by the 
IP algorithm evaluation process (1050), which in turn updates the knowledge database (260). The 
final results are, first, the IP algorithm set data (1 150) and, second, the updated knowledge database 
(260). 

[0079] Referring now to FIG. 12, there is shown a system flowchart that generally depicts 

the flow of operations and data flow of a specific example of a system for automatic mapping of raw 
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IP data (1 1 10) to IP algorithm set data (1 130) identifying relevant IP preprocessing algorithms. The 
individual data symbols, indicating the existence of data, and process symbols, indicating the 
operations to be performed on data, are as described in connection with FIG. 10 above and FIG. 11 
above. When it begins, the program control initially passes to the data preparation process (110). 
This process operates on raw IP data (1 1 10) to produce prepared data (220), then when it is finished 
passes control to the IP data analysis process (1020). The IP data analysis process (1020) operates 
on prepared data (220) to produce segmented ROI with characterized shapes data (1 120), then when 
it is finished passes control to the IP feature extraction process (1030). The IP feature extraction 
process (1030) operates on segmented ROI with characterized shapes data (1 120), to produce feature 
set data (240), then when it is finished passes control to the IP algorithm selection process (1040). 
The IP algorithm selection process (1040) uses the algorithm knowledge database (260) and operates 
on the feature set data (240) to produce IP algorithm set data (1 130), then when it is finished passes 
control to the IP algorithm evaluation process (1050). The IP algorithm evaluation process (1050) 
evaluates the IP algorithm set data (1050), and tfien uses the results of its evaluation to update the 
algorithm knowledge database (260). Moreover, advanced IP features are extracted to provide more 
accurate description of the underlying image data. The advanced IP features will be appended to the 
original feature set. After the IP algorithm evaluation process (1050) completes, the program may 
end. 

[0080] In one embodiment the particular processes described above may be made, used, 

sold, and otherwise practiced as articles of manufacture as one or more modules, each of which is a 
computer program in source code or object code and embodied in a computer readable medium. 
Such a medium may be, for example, floppy disks or CD-ROMS. Such an article of manufacture 
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may also be formed by installing software on a general purpose computer, whether installed from 
removable media such as a floppy disk or by means of a communication channel such as a network 
connection or by any other means. 

[0081] While the present invention has been described in the context of particular 

exemplary data structures, processes, and systems, those of ordinary skill in the art will appreciate 
that the processes of the present invention are capable of being distributed in the form of a computer 
readable medium of instructions and a variety of forms and that the present invention applies equally 
regardless of the particular type of signal bearing computer readable media actually used to carry out 
the distribution. Examples of computer readable media include recordable-type media such as floppy 
disc, a hard disk drive, a RAM, CD-ROMs, DVD-ROMs, an online internet web site, tape storage, 
and compact flash storage, and transmission-type media such as digital and analog communications 
links, and any other volatile or non-volatile mass storage system readable by the computer. The 
computer readable medium includes cooperating or interconnected computer readable media, which 
exist exclusively on single computer system or are distributed among multiple interconnected 
computer systems that may be local or remote. Those skilled in the art will also recognize many 
other configurations of these and similar components which can also comprise computer system, 
which are considered equivalent and are intended to be encompassed within the scope of the claims 
herein. 

[0082 ] Although embodiments have been shown and described, it is to be understood that 

various modifications and substitutions, as well as rearrangements of parts and components, can be 
made by those skilled in the art, without departing from the normal spirit and scope of this invention. 
Having thus described the invention in detail by way of reference to preferred embodiments thereof, 
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it will be apparent that other modifications and variations are possible without departing from the 
scope of the invention defined in the appended claims. Therefore, the spirit and scope of the 
appended claims should not be limited to the description of the embodiments contained herein. The 
appended claims are contemplated to cover the present invention and any and all modifications, 
variations, or equivalents that fall within the true spirit and scope of the basic underlying principles 
disclosed and claimed herein. 
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